ImageVerifierCode 换一换
格式:PPT , 页数:43 ,大小:4.17MB ,
资源ID:3719502      下载积分:12 文钱
快捷下载
登录下载
邮箱/手机:
温馨提示:
快捷下载时,用户名和密码都是您填写的邮箱或者手机号,方便查询和重复下载(系统自动生成)。 如填写123,账号就是123,密码也是123。
特别说明:
请自助下载,系统不会自动发送文件的哦; 如果您已付费,想二次下载,请登录后访问:我的下载记录
支付方式: 支付宝    微信支付   
验证码:   换一换

加入VIP,省得不是一点点
 

温馨提示:由于个人手机设置不同,如果发现不能下载,请复制以下地址【https://www.wenke99.com/d-3719502.html】到电脑端继续下载(重复下载不扣费)。

已注册用户请登录:
账号:
密码:
验证码:   换一换
  忘记密码?
三方登录: QQ登录   微博登录 

下载须知

1: 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。
2: 试题试卷类文档,如果标题没有明确说明有答案则都视为没有答案,请知晓。
3: 文件的所有权益归上传用户所有。
4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
5. 本站仅提供交流平台,并不能对任何下载内容负责。
6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。

版权提示 | 免责声明

本文(二代测序实验与测序原理.ppt)为本站会员(99****p)主动上传,文客久久仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。 若此文所含内容侵犯了您的版权或隐私,请立即通知文客久久(发送邮件至hr@wenke99.com或直接QQ联系客服),我们立即给予删除!

二代测序实验与测序原理.ppt

1、二代测序的建库与测序原理,何有裕上海生物信息技术研究中心上海众信生物技术有限公司苏州众信生物技术有限公司,内容,样本处理与测序原理简介罗氏454Illumina solexa原始数据质量控制,TruSeq RNA and DNA Sample Preparation,Cluster Generation Overview, 1000-6000 molecules per cluster,OH,Cluster Generation, Template Hybridization,diol,diol,1st cycle denaturation,Cluster Generation, Bridge

2、 PCR,Template preparation-bridge RCR,Adaptor ligation,Surface attachment,Bridge amplification,Denaturation,Trends in Genet 24:133(2008),First base incorporated,Cycle 1: Add sequencing reagents,Detect Signal,Cleave Terminator and Dye,Cycle 2-n: Add sequencing reagentsand repeat,Sequencing by Synthesi

3、s Overview,Cyclic reversible termination,All four labeled reversible terminators are added per cycleRemove unincorporated bases and detect signalRemove the terminating group and the fluorescent dye,Trends in Genet 24:133(2008),Terminating group,Fluorophore cleavage,Nat Rev Genet 11:31(2010),Base cal

4、ling,Flowcell layout on GAII,A flow cell contains 8 lanes,Lane 1,Lane 2,Lane 8,.,Column 1Column 2,Each lane contains 2 columns,Each column contains 60 tiles,Each tile is imaged 4 times per cycle,Primary Data Analysis By Firecrest and Bustard in RTA/OLB,tiff image file,Intensity file,Firecrest,Bustar

5、d,Sequence file,OH,diol,diol,OH,Cluster Generation, Sequencing Primer Hybridization(Single测序方式处理步骤),Sequence multiple samples in the same lanes,DNA insert,Read 1,Index Read,Read 2,DNA insert,Index,Index SP,Rd2 SP,Rd1 SP,Multiplexing multiple samples in the same lanes,Pair-end 测序优势,Mate-pair 建库和测序,Mo

6、lecular Ecology Resources (2011),Template preparation- emulsion PCR,Trends in Genet 24:133(2008),Pyrosequencing,Single dNTP type flows per cycleInorganic pyrophosphate (PPi) drives visible light through a series of reactionsRemove unincorporated nucleotide,Trends in Genet 24:133(2008),Base calling,H

7、omopolymer error,GV6330,20,灵活的多样本标签技术,454、solexa测序模式,Detect H+ released as a voltage changefast Common microchip design standardslow-cost manufacturingSequencing volume is increasing,Semiconductor sequencing,Fasta序列格式,Fastq 文件用4行记录一条序列,第一行以字符开头,跟在后面的是序列标识和描述 第二行是序列字符 第三行以+字符开头,后面可以为空,或者和第一行一样 第四行是第二

8、行序列质量数据的编码,长度需和第二行一样,HWI-ST507:211:C18E6ACXX:2:1101:1688:1992 1:N:0:GAGTGGCGACAATTTTTTTTGATATTAATAAAGATAGAACTTTCTTCCTATGAGTTTTCTCTC+CCCFFDFFHHHHGJJGHIIJGIIJJJJIIJJHJJJJJIJJIIIGIIIJGGIHJDIJIGAHEHFFGHGHE,Example:,Illumina sequence identifiers,HWI-EAS364_0004:4:1:995:9044#0/1,Casava 1.8以前的序列标识,Illumina

9、 sequence identifiers,HWI-ST507:211:C18E6ACXX:2:1101:1688:1992 1:N:0:GAGTGG,Casava 1.8的序列标识,序列质量,附:Solexa 1.3以前的quality计算公式是:,SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS. .XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX.IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII.JJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJ.

10、LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL. !#$%?ABCDEFGHIJKLMNOPQRSTUVWXYZ_abcdefghijklmnopqr| | | | |33 59 64 73 1040.26.31.40 -5.0.9.40 0.9.40 3.9.40 0.26.31.41 S - Sanger Phred+33, raw reads typically (0, 40) X - Solexa Solexa+64, raw reads typically (-5, 40) I - Illumina 1.3+ Phred+64, raw read

11、s typically (0, 40) J - Illumina 1.5+ Phred+64, raw reads typically (3, 40) with 0=unused, 1=unused, 2=Read Segment Quality Control Indicator (bold) (Note: See discussion above). L - Illumina 1.8+ Phred+33, raw reads typically (0, 41),Q值对应ASCII码,454原始数据图片、sff格式、fasta格式(qual),HSAPGDX01D1KDA length=18

12、1 xy=1540_3788 region=1 run=R_2012_08_01_00_39_39ACGTGTTCTGAGCCATATTGCGGTACTGGAAGGTGCGCCTGCACTGTCTGAGCACTGGTCACTGCTCGATACCAATGAAGCCTTATTTGATGAGGCGCGCACCACGCAGGCGGCGACTATTATCTTCTCGTTTGATCCAGAATAACCAAATCGAAAACGCTGGCAAGGCACACAGGGGATAHSAPGDX01D1KDA length=181 xy=1540_3788 region=1 run=R_2012_08_01_00_39

13、_3940 40 40 40 40 40 40 39 37 38 36 34 24 23 19 19 19 24 20 19 18 18 26 26 18 18 19 18 20 20 20 25 25 26 19 20 20 22 22 22 25 28 26 24 22 22 22 25 24 28 28 28 29 29 28 30 30 30 26 2626 27 27 27 31 31 30 28 28 28 30 30 30 30 26 21 21 20 20 26 27 28 24 25 20 20 20 20 19 19 19 27 28 28 30 30 31 30 28 2

14、8 30 31 31 32 32 31 31 30 30 30 31 27 24 24 22 20 20 20 22 2626 22 22 23 16 16 16 19 22 16 13 13 13 16 22 23 23 23 26 26 24 24 26 13 13 11 11 12 12 19 22 18 18 11 11 13 13 18 24 24 24 24 26 26 26 27 29 29 31 33 32 31 31 27 27 27 29 29 28 2622,454原始数据长度分布(质控后一样),Yield, data size produced by sequencer

15、.Reads, sequenced fragments.Read length and quality.Coverage fold, number of times a nucleotide is represented. Depth, the average coverage fold.Coverage rate, ratio of the region sequenced to the whole genome.Homopolymer, e.g. AAAAA,Key lab of systems biologySIBS, Chinese Academy of Sciences,一些测序中提

16、到的基本概念,通常深度测序数据处理流程,Key lab of systems biologySIBS, Chinese Academy of Sciences,序列质量评估, FastQC: A quality control tool for high throughput sequence data Java http:/www.bioinformatics.bbsrc.ac.uk/projects/fastqc/ Function:,QC pipeline,原始数据的质控过滤,Sequence level Short sequences Adaptor/primer polyA | T

17、region Overall low-complexity sequence (Dust) Contamination/unwanted sequences Ns (low quality ends) Quality level Low quality base or region 目标:所有保留的都是高质量的,真正参与生物信息分析的数据。,Clean reads,去掉含有接头序列的reads;当单端测序read中含有的N的含量超过该条read长度比例的10% 时,去除此对paired reads;当单端测序read中含有的低质量(低于5)碱基数超过该条read长度比例的50% 时,需要去除此对paired reads。,Reads中不合格的碱 基判断标准:reads中出现N, 记个数reads中碱基质量分数低于20分, 记个数去除的reads条件:质 量不合格的碱基占reads长度的10%以 上(即10bp)没 有3接 头的reads5接头污染的reads没 有插入判断的reads,

Copyright © 2018-2021 Wenke99.com All rights reserved

工信部备案号浙ICP备20026746号-2  

公安局备案号:浙公网安备33038302330469号

本站为C2C交文档易平台,即用户上传的文档直接卖给下载用户,本站只是网络服务中间平台,所有原创文档下载所得归上传人所有,若您发现上传作品侵犯了您的权利,请立刻联系网站客服并提供证据,平台将在3个工作日内予以改正。