我正在尝试读取包含 id 和序列的核苷酸序列文件。默认情况下,序列在 70 位核苷酸序列后由新行分隔。
输入文件 (seq.txt) 看起来像这样。
seqgb_AY741213_Organism_Influenza_A_virus__A_blackbird_Hunan_1_2004_H5N1___Strain_Name_A_blackbird_Hunan_1_2004_Segment_4_Subtype_H5N1_Host_Blackbird,
ATGGAGAAAATAGTGCTTCTTCTTGCAATAGTCAGTCTTGTTAAAAGTGATCAGATTTGCATTGGTTACC
ATGCAAACAACTCGACAGAGCAGGTTGACACAATAATGGAAAAGAACGTTACTGTTACACATGCTCAAGA
CGTACTGGACAAGACACACAACGGGAACACTCAGTTTGAGGCCGTTGGAAGGGAATTTAATAACTTAGAA
AGGAGAATAGAAAATTTAAACAAGAAGATGGAGGACGGATTCCTAGATGTCTGGACTTATAATGCTGAAC
TTCTGGTTCTCATGGAAAATGAGAGAACTCTAGACTTTCATGACTCAAATGTCAAGAACCTTTACGAAAA
GGTCCGACTACAACTTAGGGATAATGCAAAGGAGCTGGGTAACGGTTGTTTCGAGTTCTATCACAAATGT
GATAATGAATGTATGGAAAGTGTAAGAAACGGAACGTATGACTACCCGCAGTATTCAGAAGAAGCAAGAC
TAAACAGAGAGGAAATAAGTGGAGTAAAATTGGAATCAATAGGAACTTACCAAATACTGTCAATTTATTC
AACAGTGGCGAGTTCCCTAGCACTGGCAATCATGGTAGCTGGTCTATCTTTATGGATGTGCTCCAATGGA
TCGTTACAATGCAGAATTTGCATTTGA
seqgb_EU676325_Organism_Influenza_A_virus__A_brown-head_gull_Thailand_vsmu-4_2008_H5N1___Strain_Name_A_brown-head_gull_Thailand_vsmu-4_2008_Segment_4_Subtype_H5N1_Host_Brown-Headed_Gull,
TTTAGCAAAAGGCAGGGGTATATCTGTCAAAATGGAGAAAATAGTGCTTCTTTTTGCAATAGTCAGTCTT
GTTAAAAGTGATCAGATTTGCATTGGTTACCATGCAAACAACTCGACAGAGCAGGTTGACACAATAATGG
AAAAGAACGTTACTGTTACACATGCCCAAGACATACTGGAAAAGACACACAACGGGAAGCTCTGCGATCT
AGATGGAGTGAAGCCTCTAATTTTGAGAGATTGTAGTGTAGCTGGATGGCTCCTCGGAAACCCAATGTGT
GACGAATCTCCAATGGGGGCGATAAACTCTAGTATGCCATTCCACAATATACACCCTCTCACCATCGGGG
AATGCCCCAAATATGTGAAATCAAACAGATTAGTCCTTGCGACTGGGCTCAGAAATAGCCCTCAAAGAGA
GAGAAGAAGAAAAAAGAGAGGATTATTTGGAGCTATAGCAGGTTTTATAGAGGGAGGATGGCAGGGAATG
GTAGATGGTTGGTATGGGTACCACCATGAACTTCTGGTTCTCATGGAAAATGAGAGAACTCTAGACTTTC
ATGACTCAAATGTCAAGAACCTTTACGACAAGGTCCGACTACAGCTTAGGGATAATGCAAAGGAGCTGGG
TAACGGTTGTTTCGAGTTCTATCATAAATGTGATAATGAATGTATGGAAAGTGTAAGAAACGGAACGTAT
GACTACCCACAGTATTCAGAAGAAGCAAGACTAAAAAGAGAGGAAATAAGTGGAGTAAAATTGGAATCAA
TAGGAATTTACCAAATACTGTCAATTTATTCTACAGTGGCGAGTTCCCTAGCACTGGCAATCATGGTAGC
TGGTCTATCCTTATGGATGTGCTCCAATGGGTCGTTACAATGCAGAATTTGCATTTAAATTTGTGAGTTC
AGATTGAG
seqgb_EF178528_Organism_Influenza_A_virus__A_brown-headed_gull_Thailand_VSMU-28-SPK_2005_H5N1___Strain_Name_A_brown-headed_gull_Thailand_VSMU-28-SPK_2005_Segment_4_Subtype_H5N1_Host_Brown-Headed_Gull,
AGCAAAAGCAGGGGTATAATCTGTCAAAATGGAGAAAATAGTGCTTCTTTTTGCAATAGTCAGTCTTGTT
AAAAGTGATCAGATTTGCATTGGTTACCATGCAAACAACTCGACAGAGCAGGTTGACACAATAATGGAAA
AGAACGTTACGAATGATGCAATCAACTTCGAGAGTAATGGAAATTTCATTGCTCCAGAGTATGCATACAA
AATTGTCAAGAAAGGGGACTCAACAATTATGAAAAGTGAATTGGAATATGGTAACTGCAACACCAAGTGT
CAAACTCCAATGGGGGCGATAAACTCAAGGTCAACTCGATCATTGACAAAATGAACACTCAGTTTGAGGC
CGTTGGAAGGGAATTTAACAACTTAGAAAGGAGAATAGAGAATTTAAACAAGAAGATGGAAGACGGGTTC
CTAGATGTCTGGACTTATAATGCTGAACTTCTGGTTCTCCTGGAAAATGAGAGAACTCTAGACTTTCATG
ACTCAAATGTCAAGAACCTTTACGACAAGGTCCGACTACAGCTTAGGGATAATGCAAAGGAGCTGGGTAA
CGGTTGTTTCGAGTTCTATCATAAATGTGATAATGAATGTATGGAAAGTGTAAGAAACGGAACGTATGAC
TACCCACAGTATTCAGAAGAAGCAAGACTAAAAAGAGAGGAAATAAGTGGAGTAAAATTGGAATCAATAG
GAATTTACCAAATACTGTCAATTTATTCTACAGTGGCGAGTTCCCTAGCACTGGCAATCATGGTAGCTGG
TCTATCCTTATGGATGTGCTCCAATGGGTCGTTACAATGCAGAATTTGCATTTAAATTTGTGAGTTCAGA
T
seqgb_CY091790_Organism_Influenza_A_virus__A_chicken_Ampenan_BBVD-282_2007_H5N1___Strain_Name_A_chicken_Ampenan_BBVD-282_2007_Segment_4_Subtype_H5N1_Host_Chicken,
TCAATCTGTCAAAATGGAGAAAATAGTGCTTCTTCTTGCAATAGTCAGTCTTGTTAAAAGTGATCAGATT
TGCATTGGTTACCATGCAAACAATTCAACAGAGCAGGTTGACACAATAATGGAAAAGAACGTTACTGTTA
CACATGCCCAAGACATACTGGAAAAGGGAAAATGAGAGAACTCTAGACTTTCATGACTCAAATGTTAAGA
ACCTCTACGACAAGGTCCGACTACAGCTTAGGGATAATGCAAAGGAGCTGGGTAACGGTTGTTTCGAGTT
CTATCACAAATGTGATAATGAATGTATGGAAAGTATAAGAAACGGAACGTATAACTACCCGCAGTATTCA
GAAGAAGCAAGATTAAAAAGAGAGGAAATAAGTGGAGTAAAATTGGAATCAATAGGAACTTACCAAATAC
TGTCGATTTATTCAACAGTGGCGAGTTCCCTAGCACTGGCAATCATGATGGCTGGTCTATCTTTATGGAT
GTGCTCCAATGGATCGTTACAATGCAGAATTTGCATTTAAATTTGTGAGTTCAGATTGTAGTTAAA
seqgb_KT216634_Organism_Influenza_A_virus__A_chicken_Anhui_MG08_2008_H9N2___Strain_Name_A_chicken_Anhui_MG08_2008_Segment_4_Subtype_H9N2_Host_Chicken,
AGCAAAAGCAGGGGAATTTCACAACCACTCAAGATGGAGACAGTATCACTAATAAATATACTACTAGTAG
TAACAGTAAGCAATGCAGATAAAATCTGCATCGGCTATCAATCAACAAATTCCACAGAAACTGTAGACAC
ACTAACAGAAAACAATGTCCCTGTGATTGTAATTGCAATGGGGTTTGCTGCCTTCTTGTTCTGGGCCATG
TCCAATGGGTCTTGCAGATGCAACATTTGTATATAATTGGCAAAAACACCCTTGTTTCTACT
seqgb_KY005855_Organism_Influenza_A_virus__A_chicken_Anhui_MZ33_2016_H5N6___Strain_Name_A_chicken_Anhui_MZ33_2016_Segment_4_Subtype_H5N6_Host_Chicken,
ATGGAGAAAATAGTGCTTCTTCTTGCAGTGGTTAGCCTTGTTAAAGGTGATCAGATTTGCATTGGTTACC
ATGCAAACAACTCGACTGAGCAGGTTGACACGATAATGGAAAAAAACGTCACTGTTACACATGCTCAAGA
CATACTAGAAAGGAATATGGCAATTGCAACACCAAATGTCAAACTCCAATAGGGGCGATAAACTCTAGTA
TGCCATTCCACAATATACACCCTCTCACTATCGGGGAGTGCCCCAAATATGTGAAATCAAACAAATTAGT
CCTTGCGACTGGGCTCAGAAATAGTCGAATCCACCCAAAAGGCAATAGATGGAGTTACCAATAAGGTCAA
CTCGATAATTGACAAAATGAACACTCAGACGGATTCCTAGATGTCTGGACTTATAATGCTGAACTTTTAG
TTCTCATGGAAAATGAGAGAACTCTAGATTTCCATGACTCAAATGTCAAGAACCTTTATGACAAAGTCCG
ACTACAGCTTAGGGATAATGCAAAGGAGCTGGGTAATGGTTGTTTCGAGTTCTATCACAAATGTGATAAT
GAATGTATGGAAAGTGTGAGGAATGGGACGTATGACTACCCCCAGTATTCAGAAGAAGCAAGATTAAAAA
GGGAAGAAATAAGCGGAGTGAAATTGGAATCAATAGGAACTTACCAAATACTGTCAATTTATTCAACAGT
GGCGGGTTCCCTAGCACTGGCAATCATTGTGGCTGGTCTATCTTTATGGATGTGCTCCAATGGGTCGTTA
CAATGCAGAATTTGCATTTAA
seqgb_KY005863_Organism_Influenza_A_virus__A_chicken_Anhui_MZ34_2016_H5N6___Strain_Name_A_chicken_Anhui_MZ34_2016_Segment_4_Subtype_H5N6_Host_Chicken,
ATGGAGAAAAGAAGAACGATGCATACCCAACAATAAAAATGAGCTACAATAACACCAATAGGGAAGATCT
TTTGATACTGTGGGGGATTCATCATTCCAATAATGCAGAAGAGCAGACAAATCTCTATAAAAACCCAACC
ACCTATGTTTCCGTTGGGACATCAACATTAAACCAGAGAGTGGTGCCAAAAATAGCTACTAGATCCCAAG
TAAACGGGCAAAGTGGAAGAATGGATTTCTTCTGGACAATTTTAAAACCGGATGATGCAATCCACTTCGA …Run Code Online (Sandbox Code Playgroud)