连续两行fastq、连续两行MD5值如何转换为每行一个fastq一个MD5格式

 

001、shell实现

(base) [b20223040323@admin2 test]$ ls
a.txt
(base) [b20223040323@admin2 test]$ cat a.txt      ## 测试数据
SRR5534377_1.fastq.gz
SRR5534377_2.fastq.gz
d27d0b0f0bb9cae5dc52dc934384699b
1139b32cfc69010f2165fa0fa82be6bc
SRR5534400_1.fastq.gz
SRR5534400_2.fastq.gz
fe95ce69fdb56520da8ef53d27bb19cb
5e3a831991d9b4e642d02e73b69677ff
SRR5534403_1.fastq.gz
SRR5534403_2.fastq.gz
444d55c9dae6e0d357fd9cb92461fdc9
978bf203ef0ef85a673e454ebed6794f
(base) [b20223040323@admin2 test]$ awk 'NR % 4 == 1 {a=$0} NR % 4 == 2 {b = $0} NR % 4 == 3 {print a, $0} NR % 4 == 0{print b, $0}' a.txt   ## 根据行号记录和输出特定行
SRR5534377_1.fastq.gz d27d0b0f0bb9cae5dc52dc934384699b
SRR5534377_2.fastq.gz 1139b32cfc69010f2165fa0fa82be6bc
SRR5534400_1.fastq.gz fe95ce69fdb56520da8ef53d27bb19cb
SRR5534400_2.fastq.gz 5e3a831991d9b4e642d02e73b69677ff
SRR5534403_1.fastq.gz 444d55c9dae6e0d357fd9cb92461fdc9
SRR5534403_2.fastq.gz 978bf203ef0ef85a673e454ebed6794f

image

 。

 

002、python

(base) [b20223040323@admin2 test]$ ls
a.txt  test.py
(base) [b20223040323@admin2 test]$ cat a.txt
SRR5534377_1.fastq.gz
SRR5534377_2.fastq.gz
d27d0b0f0bb9cae5dc52dc934384699b
1139b32cfc69010f2165fa0fa82be6bc
SRR5534400_1.fastq.gz
SRR5534400_2.fastq.gz
fe95ce69fdb56520da8ef53d27bb19cb
5e3a831991d9b4e642d02e73b69677ff
SRR5534403_1.fastq.gz
SRR5534403_2.fastq.gz
444d55c9dae6e0d357fd9cb92461fdc9
978bf203ef0ef85a673e454ebed6794f
(base) [b20223040323@admin2 test]$ cat test.py
#!/usr/bin/env python
# -*- coding:utf-8 -*-
input_file = "a.txt"
output_file = "out.txt"
list1 = []
with open(input_file, "r") as f:
        for line in f:
                line = line.strip()
                if line:
                        list1.append(line)
with open(output_file, "w") as f:
        for i in range(0,len(list1),4):
                fq1,fq2,md51,md52 = list1[i:i+4]
                f.write(f"{fq1}\t{md51}\n")
                f.write(f"{fq2}\t{md52}\n")
(base) [b20223040323@admin2 test]$ python test.py
(base) [b20223040323@admin2 test]$ ls
a.txt  out.txt  test.py
(base) [b20223040323@admin2 test]$ cat out.txt
SRR5534377_1.fastq.gz   d27d0b0f0bb9cae5dc52dc934384699b
SRR5534377_2.fastq.gz   1139b32cfc69010f2165fa0fa82be6bc
SRR5534400_1.fastq.gz   fe95ce69fdb56520da8ef53d27bb19cb
SRR5534400_2.fastq.gz   5e3a831991d9b4e642d02e73b69677ff
SRR5534403_1.fastq.gz   444d55c9dae6e0d357fd9cb92461fdc9
SRR5534403_2.fastq.gz   978bf203ef0ef85a673e454ebed6794f

image

 。

 

posted @ 2025-10-18 22:50  小鲨鱼2018  阅读(1)  评论(0)    收藏  举报