有人可以帮助我 - 我需要一个正则表达式,只匹配包含字符ADFHKMPRTWCEGILNQSVY和NOTHING其他的行.
我需要遍历看起来像这样的文本行:
>gi|46450118|gb|AAS96767.1| femAB family protein [Desulfovibrio vulgaris str. Hildenborough]
MVDLSRKKTQALLPTDILFQTPYWAQVKTRLGMESHAFDIRSSGPWGDVLVLLRRFGRHRVAIVPQGPEV
APPHEDYGVYLESFSLALAEGLGPDVAFIRYDLPWVSPYADEMHDEGWNAFPEARLRELRMNMGTRHWNL
RKSFQDLTVASSLVVDITGEEAAVLERMKPKTRYNIGLARRKGVAVREVGRESLPQFHALYRQTAIRNGF
EPCSITHFSAMFHALCDGAGSTELLFLLATHGTDILAGCIVGLAGRTANFLYGASGNVKRNLMAPYLMHW
TAMCHARDRGCHDYEMGAVPPGHDPAHPFHGLYRFKTGFGGRVALRSGSWDYPLDHAAYRDFCNAESLYR
TDAAPGRTQ
>gi|46450117|gb|AAS96766.1| iron-sulfur protein CooF [Desulfovibrio vulgaris str. Hildenborough]
MNHEELFVIQAEAEKCRACRKCELACIASHNNLTIKEAAKKRTVFAPRVHVVKTDEVKMPVQCRQCKDAP
CARVCPTRALVQDDGVVTMRAQFCAACRLCIMACPYGAISLSFIGLPEEDEAGAMHGREVAVRCDLCSEW
RAREGKSSCACVEACPTKALHMVPLAEARGRHQ
>gi|46450116|gb|AAS96765.1| hydrogenase nickel insertion protein HypA [Desulfovibrio vulgaris str. Hildenborough]
MHEASIVAGIMRIVEEEAARHDVTRIARVRLRVGLLTGVEPRTLTACFELYSEGTVAEGASLDLETVPAL
GTCHACGATFDLHRRCFACPTCGNDDITLEGGRELTIAGLEVPQPEGATA
>gi|46450115|gb|AAS96764.1| carbon monoxide-induced hydrogenase CooH, putative [Desulfovibrio vulgaris str. Hildenborough]
MSTPDSTTQTWTLPVGPLHVALEEPMYFKLDVDGEIVRNVEITAGHVHRGMEALAMRRNLFQNIVLTERV
CSLCSNSHPFTYCMAVEHLAGIEVPARADHLRVVAEEIKRTASHLFNVAILAHIIGFKSLFMHVMEVREI
MQDIKETVYGNRMDLAANCIGGVKYDVDAELLAMLLAGLDKVERNAREIYRIYASDPMVTGRTTGIGVLP
PDEARRFGVVGPVARGSGLAVDVRRDVPYAAYPQLSFDVITEEGCDVRARALVRLREVFESISIIRQCVA
TLPEGAMTVIMPEIPAGQSVARSEAPRGELMYYLRTDGTDIPNRLKWRVPSYMNWDALGVMMRDANVADI
PLIVNSIDPCISCTER
>gi|46450114|gb|AAS96763.1| hydrogenase, CooU subunit, putative [Desulfovibrio vulgaris str. Hildenborough]
MPDNALTAPLATALDALAEAEGFTWTRDAHGNAYGWLRLAERDTLPEAARLLAEGGARLATVTAYDPVRE
PGVPRQEIAYHFDVHGTTLTVTVVLDPECPSVPSITPHFRNADWNEREFMEMYDIAVPGHPNPRRLFLDE
KLDAGIMNTIIPLSTMTNGASTQNLWERILAARPGDKA
>gi|46450113|gb|AAS96762.1| hydrogenase, CooX subunit, putative [Desulfovibrio vulgaris str. Hildenborough]
MFGFLKVLARNVLKGPSTDPFPFAEAHTPARFRGQVRLDPALCVGCAICHHVCAGGAINIAEREDGSGYD
FTVWHNTCALCGLCRHYCPTGAITLSNDWHNAHLQSQKYDWCERQFVPFMQCEGCGAHIRPLPPQLAARA
YGPGGFDFASFMRLCPSCRQLAAARADVHIPEASAMPAAPAGHADEPAIREGDATAVTVKGDETPATGVQ
Q
Run Code Online (Sandbox Code Playgroud)
它们都以>开头,所以我可以寻找它.但是,我想确保我得到正确的行,所以我还想要一个与包含ADFHKMPRTWCEGILNQSVY的行匹配的正则表达式.
干杯,
斯特凡
像这样的东西
/^[ADFHKMPRTWCEGILNQSVY]+$/
Run Code Online (Sandbox Code Playgroud)