随笔分类 - [25] 生物信息学
计算生物信息学,计算生物学,生物统计学,生物数学,分子生物学,遗传学,医学遗传学
摘要:在线序列转换的工具: Nucleotide Sequence Translation Protein Sequence Back-translation http://bio.lundberg.gu.se/edu/translat.html http://www.ebi.ac.uk/Tools/st
阅读全文
摘要:使用tophat和cufflinks计算RNA-seq数据的表达水平时,当一个基因在一个样本中有多个表达水平时需要合并它们的表达水平。 This code is a solution to collapsing duplicate FPKMs for a gene. CollapseFPKM Col
阅读全文
摘要:在SAM输出的结果中每一行都包括十二项通过Tab分隔(\t),从左到右分别是: 1 QNAME,序列的名字(Read的名字) 2 FLAG, 概括出一个合适的标记,各个数字分别代表 1 序列是一对序列中的一个 2 比对结果是一个pair-end比对的末端 4 没有找到位点 8 这个序列是pair中的
阅读全文
摘要:Cole Trapnellsaid:there are three strategies:1) merge bams and assemble in a single run of Cufflinks2) assemble each bam and cuffcompare them to get a...
阅读全文
摘要:Lift genome positionsGenome positions are best represented inBED format. UCSC provides tools to convert BED file from one genome assembly to another.B...
阅读全文
摘要:library(GenomicAlignments)
阅读全文
摘要:samtools flagstat /SRA111111/SRR111222/accepted_hits.bam 78406056 + 0 in total (QC-passed reads + QC-failed reads) (1)0 + 0 duplicates78406056 + 0 map
阅读全文
摘要:mycount=`cat SRR108114_new_1.fastq | wc -l` echo 'Number of clean reads, SRR108114_new_1.fastq: '$((mycount/4)) >> SRR108114.stat mycount=`cat SRR1081...
阅读全文
摘要:Systematic LncRNA ClassificationFrom: http://www.arraystar.com/Services/Services_main.asp?ID=307Analyzing the genomic context of LncRNAs can help pred...
阅读全文
摘要:-------------------------------Long noncoding RNAs are rarely translated in two human cell lines.(蛋白,多肽)Bánfai B, Jia H, Khatun J, Wood E, Risk B, Gun...
阅读全文
摘要:长链非编码RNA(lncRNA)转自:http://blog.sina.com.cn/s/blog_909da11301010bkz.html长链非编码RNA(lncRNA)是一类转录本长度超过200nt的RNA分子,它们并不编码蛋白,而是以RNA的形式在多种层面上(表观遗传调控、转录调控以及转录后...
阅读全文
摘要:转自:http://blog.sina.com.cn/s/blog_8088f3700101pab7.html权威发布:长链非编码RNA命名规则对于人类基因命名标准的制定而言,雨果基因命名委员会(HGNC)是唯一官方授权的机构。HGNC的数据库中有38000个基因名称,其中大部分是编码蛋白基因;但H...
阅读全文
摘要:GBrowse配置相关资料(形状、颜色、配置、gff3)http://gmod.org/wiki/Glyphs_and_Glyph_Optionshttp://gmod.org/wiki/GBrowse_Configuration/Glyphshttp://gmod.org/wiki/GBrowse...
阅读全文
摘要:[1] bedtools (https://github.com/arq5x/bedtools2)here is also bedtools (https://github.com/arq5x/bedtools2) getfasta. It uses Erik's code under the ho...
阅读全文
摘要:Use a regular expression for filtering sequences by id from a FASTA file, e.g. just certain chromosomes from a genome. There are other toolsas part of...
阅读全文
摘要:bgzip – Block compression/decompression utilitytabix – Generic indexer for TAB-delimited genome position filesSAMtoolsandBCFtoolsare distributed as in...
阅读全文
摘要:VCF文件示例(VCFv4.2)##fileformat=VCFv4.2##fileDate=20090805##source=myImputationProgramV3.1##reference=file:///seq/references/1000GenomesPilot-NCBI36.fast...
阅读全文
摘要:R, BioconductorfilterVcf: Extract Variants of Interest from a Large VCF File (Paul Shannon)We demonstrate three methods: filtering by genomic region, ...
阅读全文
摘要:The C++ executable module examples This page provides usage examples for the executable module. Extended documentation for all of the options can be f
阅读全文
摘要:如果是Linux系统,随意打开一个终端;如果用的是Windows系统,那么打开命令提示符。 输入以下命令: perldoc Bio::SeqIO 以上命令的作用是查看Bio::SeqIO模块的文档是否存在,如果存在,则会有相应的文档输出,则安装了Bio::SeqIO模块;如果没有,说明Perl找不到
阅读全文