Monarch geneset OGS2.0

DPOGS207432
TranscriptDPOGS207432-TA3627 bp
ProteinDPOGS207432-PA1208 aa
Genomic positionDPSCF300087 + 513603-518140
RNAseq coverage1215x (Rank: top 10%)
Annotation
HeliconiusHMEL0156200.069.60% 
BombyxBGIBMGA009332-TA0.064.57% 
Drosophila% 
EBI UniRef50UniRef50_UPI00020627A89e-1648.65%UPI00020627A8 related cluster n=1 Tax=unknown RepID=UPI00020627A8
NCBI RefSeqXP_002424198.12e-0839.76%glutamic acid-rich protein precursor, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3287242363e-1548.65%PREDICTED: hypothetical protein LOC100572668 [Acyrthosiphon pisum]
NCBI nr blastxgi|1571069703e-2222.41%hypothetical protein AaeL_AAEL004658 [Aedes aegypti]
Group
KEGG pathway 
Orthology groupMCL25600 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207432-TA
ATGATGTTATATGTGACCAGTAATAGTGATGTGGTCCGTCGACCGCCTCGGAGGCGACACATCAGCGCCAACATTAAGCGTCATCATGCCGGTATATCCCAGTCCTCGTGCAGTGACGGCTCTCTGCTCAGTGTGGGCTCCTCGGAGATGGATGACGACAGCAGCAGTGGACACCAGCTGGACCAGACACGACACGAGCACTCAGAAGTCTACACATCGTCCGAGCCTCCCTCTGGTGTGGCGCCACTATCGCACTCAGCCGCTCGACACAAGATGGCGGTGAGACCGCGACGCAACCACGCTTTGCCACGGAGGAAAAAAAACAACTCGATCGCTGCCTCAGCCTTGCCGATAACTCCCGAGCTGAACGAAGAAATGACACGAAGCACTACTCCTGAAGTCATCCTCAAATCCTCAGAAGTCGTTACTGAATCATTCTCTTCGACTACTGCGAAGCATCACATGATCGTTAAGGACCAACACCAACAGCTGAACCGGGAACTCCAAAAGGTTCTGGAAACACCGAGCGACACTAAACTGAAGTCCTCGTCCCTACCGCCCGGATTGGCTCTGAGTCAACTCGTTGGCCAATCACCTGTCAAACTGAGCATCGCTGACAATAATACCCCGAGATCGTCGATCAAAAGAAGCAAATCCAGTACACAAGAACAACCAATACGAGAGAGTTCACCGAAACTAAACGAAAGCCGACAACCGGAGGATAGGAACAAACACCGCTCTGACAAAAAACAAGCCGACGAACCTTGCCTCGAAAAGAAAACTAAGTCAGAGAAGAAAGTAGAGAACGAAGTCATACGGTCTTCCAAAAAAGAGGAATCGTTTTTCAGCAGACTGCTTCTGAGGAAAAGCGGAAAGAAATCTAAGAAGGATCAAACCGACGGCGACGGGCAGCAGGACGTGAAGAAAACGAAGACTGAGAAGACGACGGCGCAATACAAACCAGTGGACTCGGGCTGTTACTACGAACAAGGATACAAACACACAGACAGGTCCTATAAATCAGCCGGCCAGAAGGCGTTCGCTAATCAAATGCACAAGAGAATTGATAACAAAGGAGCCTACGTCCACGAGGGCGCGTACAAAGACGTCTACAGCGCTGCCAAAGTCACCGGCCTCGATTACAGCATAGATCTCAAAGCCGCCGAGGAAACCGATAAAATGTTGAACTACGAGATGAAAAACAGATTCAGCAGGAGGGACGAGTTCTGCGAGAAGATAGAAAAGAAACGTTCTTCGTCCAAAGAGGCGATCGATGACAAGAGGGAATCTTTCAATTTGCATCACGATAAATCGAAAAGACCCGCCTCCGTACAGAGATTGATCGAACCGTTCTCAACAAAAACACTTCTCGCCAACGAACTCGCTAACAGGACGGACGAAGACAGTCTGTCTCCTACATTCGAAGTTTCCGACGAAGATTCCATTAGAGGCCTGAGAATAAAAAACGTTAATAACGATTGCGTGTTCCACAGCGGCAGTATGCCCAAGAATATACCGTACTTCAATCCGAGCATCTCTGTCTCCGTGTCACCTGTGAAAAGCAAGACTCACAGCTCCGAAAACGTCAAGTATATGAGCCGTTCCACGGAAGCAGATTTCCACGGGATGGAAATAGTCAGCAAACGAGAAAACGTTCATCATGTTTGCAAAACTGATGAACCATACGTCGCTTCCGGTTTGAGATCTTTGGGTGACGAATACGTGGAAACTAGAAGCAGTATCGGTAAATCTCACAGTTTTCGTTACGCGTCCCAATCATCTTCGATCAGTTCCCAGGAAAACCAACTACCGAGTCTACCGGCCATCGTGGGCATCTCTGAACCGCTGCTTGAAAGCTGGGAAGTCAACTATAGGAGGAACGTTAGTGAGAGACACGACCACTCCAAGCGCATATCCGCCAGGAACTATAACATCCTGTCCGGTGAATCCGAGACAAACTACGACAGTCTACCGAGCACTGACAGCAGCTACCTAGACAGCCTCAAACAGGACATGAGCGAGGAAAGAAAGCTGTTTACAAACAGCATTCAAATCACAATAGATACTCACAAACGCGACAGCGATATCTCTCAGATCGAGGCTAAAATCGACGATATCATCAGCTCACCTAAGCCTATCATATCTCCAATACTCAAATCTAGTTCGTTAGATAGCGTTAAAAGTAGTCCCGAAAAACCTGCCGCAGACAGGCGCAAGACCATATCAGTAGAAAGTGCTATAGGACAGAGTAATAAAAATATTCAAAAAGAATTAAAGCAAGAGCCGGAGGGATTTATTTCAGTGACACACATAAATAGAGACAACGTGCCAGTTGTGATTAAGGCTGTTGAAATATCTAAGATAAATAAAAGCGACGAGAAGATACAAAAAACTGTAGCTAAGTCTGGAGTGCCGGAGTTCCTCAATATACAGCTGAACAAGGTTGACGCGAAACCAGTTACTAACGTTGTGTTGACAGCGAATGTGTCCCCAAAAAAGATTGACAGTCCGCAACCAGAGAAGGAACCGATCGTAGAAAATTTCGCCGCCCCTGATACGCAAATATCACAGAATATGAAAACCTCTAAGGAAATAACGAAAACGGTTGATGACACTGAGACTGTACAAGTTGAGGAGAAACCAAATGTGGTCGTGGCCAGAACACTGAGCACACCCGCGCCACAGAGTCCCATGACACCGAAGCATTTCTTTAAAAAGAAATTACTCAGCGTCGACTCACAGGAAAAAATAGAAAAACAAAGAACCAGTTCCGTCAGCACCGAAGGAAGCATCGAGAAGATCGATAACATATCCATGGACCAAAAGTCACATTCCAGTTTCGGCAGCAAAAGTTCCATTCAGAGTATCGACAGCGACGAGAACAAAATACAAGACAGACAGGAGGAAGCGGTCGTCTACAGAAGAAAACCCTTCGGCAAAGACTCCAAGAAACATGACGACGAACCAGAACTTATGAAGGTTTTCGCGAGGCGGTCCCTCAAACTCAAAGACTCGGAAGCAGACCACATAGCGCAGGACATAGCCGACACCAACAAAAACGATAATGTATCCAGGTCTAAAATACTCAAGAACGAGTTCAACGCTTCCATCAAATCTAGGGACAGCGACAAAGAGAACGAGGAACAGAAGGAACAAGCATTTGAGAATAAGCTGGTGGACATCGCGGCGAGGGTGAGTCAGTTCGGAAACTACCAGAGGAGCGTCAGCATCAGCAGTGTGACGCCCAAACGAGACAGCGCGCCCGCCTTCAGGAGCGAGGTGAACAAATACAAGAAAGAAGTATCCGACTCCACTCCCGAGAAGAGATTGAGGAACAGAACGTTCCCCGACTCATCCAACGACAGAGAGGACATCAAAAACGTTACCAAGAATGAAGCCATGGGATATAAAGCGGATACACTCACCAAAAGACCCTGGCAGAGGACCGAGTTCAGGCAGGTGGTGGAGAAAGAGAAACGAGACGTTACCGCGGTAGAAAAGGACGGGAACAATGAGAAAAGCGAAAGCGGGAAAGAGAAGGAAGAAGCGGACGCCTCACCACAGTTCAAAGGTATACTCCAAATGAGAGCGGAGTGGGAGAGACGAGCTCAAGGAATGACCAAATAA

Protein sequence:

>DPOGS207432-PA
MMLYVTSNSDVVRRPPRRRHISANIKRHHAGISQSSCSDGSLLSVGSSEMDDDSSSGHQLDQTRHEHSEVYTSSEPPSGVAPLSHSAARHKMAVRPRRNHALPRRKKNNSIAASALPITPELNEEMTRSTTPEVILKSSEVVTESFSSTTAKHHMIVKDQHQQLNRELQKVLETPSDTKLKSSSLPPGLALSQLVGQSPVKLSIADNNTPRSSIKRSKSSTQEQPIRESSPKLNESRQPEDRNKHRSDKKQADEPCLEKKTKSEKKVENEVIRSSKKEESFFSRLLLRKSGKKSKKDQTDGDGQQDVKKTKTEKTTAQYKPVDSGCYYEQGYKHTDRSYKSAGQKAFANQMHKRIDNKGAYVHEGAYKDVYSAAKVTGLDYSIDLKAAEETDKMLNYEMKNRFSRRDEFCEKIEKKRSSSKEAIDDKRESFNLHHDKSKRPASVQRLIEPFSTKTLLANELANRTDEDSLSPTFEVSDEDSIRGLRIKNVNNDCVFHSGSMPKNIPYFNPSISVSVSPVKSKTHSSENVKYMSRSTEADFHGMEIVSKRENVHHVCKTDEPYVASGLRSLGDEYVETRSSIGKSHSFRYASQSSSISSQENQLPSLPAIVGISEPLLESWEVNYRRNVSERHDHSKRISARNYNILSGESETNYDSLPSTDSSYLDSLKQDMSEERKLFTNSIQITIDTHKRDSDISQIEAKIDDIISSPKPIISPILKSSSLDSVKSSPEKPAADRRKTISVESAIGQSNKNIQKELKQEPEGFISVTHINRDNVPVVIKAVEISKINKSDEKIQKTVAKSGVPEFLNIQLNKVDAKPVTNVVLTANVSPKKIDSPQPEKEPIVENFAAPDTQISQNMKTSKEITKTVDDTETVQVEEKPNVVVARTLSTPAPQSPMTPKHFFKKKLLSVDSQEKIEKQRTSSVSTEGSIEKIDNISMDQKSHSSFGSKSSIQSIDSDENKIQDRQEEAVVYRRKPFGKDSKKHDDEPELMKVFARRSLKLKDSEADHIAQDIADTNKNDNVSRSKILKNEFNASIKSRDSDKENEEQKEQAFENKLVDIAARVSQFGNYQRSVSISSVTPKRDSAPAFRSEVNKYKKEVSDSTPEKRLRNRTFPDSSNDREDIKNVTKNEAMGYKADTLTKRPWQRTEFRQVVEKEKRDVTAVEKDGNNEKSESGKEKEEADASPQFKGILQMRAEWERRAQGMTK-