Monarch geneset OGS2.0

DPOGS208748
TranscriptDPOGS208748-TA4455 bp
ProteinDPOGS208748-PA1484 aa
Genomic positionDPSCF300043 + 492015-497628
RNAseq coverage833x (Rank: top 15%)
Annotation
HeliconiusHMEL0152226e-13360.35% 
BombyxBGIBMGA003411-TA1e-3061.54% 
DrosophilaCG32249-PB2e-0725.81% 
EBI UniRef50UniRef50_D6WMQ52e-4028.28%Serine protease P146 n=2 Tax=Tribolium castaneum RepID=D6WMQ5_TRICA
NCBI RefSeqXP_001950908.12e-4530.86%PREDICTED: similar to conserved hypothetical protein [Acyrthosiphon pisum]
NCBI nr blastpgi|910819157e-4028.28%PREDICTED: similar to corin [Tribolium castaneum]
NCBI nr blastxgi|1571048755e-9025.66%hypothetical protein AaeL_AAEL014367 [Aedes aegypti]
Group
KEGG pathwaypca:Pcar_20392e-12 
 K08300 (rne)maps-> RNA degradation
InterPro domain[153-251] IPR0000827.9e-11SEA
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208748-TA
ATGGATCAACCGCCGAGGGAGATAGAAAAGCTGCTTCATAAAACCGAAGGTTTAATGTCAAAGGATGTGGAGAAAGGAGAGAAGTTATACGTGACTGCAGACAAGCGCGGAGAGAAAGGAGGCTGCAAACGACCGCTCTGCTGGACGCTCCTTGGGCTGGTTGTTGCCGCTATCGTGGCTCTCATTGTTTTAGCAGCAACTGGAATACTATTTTCAAATTCTCCGACCGCTCTCGAACCTTACAACTCATCTGTGAGTTCAGCGCGCGCTTTCAGCGGCATCACACACGTCCACAGTCACGATCATGACCACAGTCATGACCACAACCACGACCACGACCACGACGACCACGGTCGCAACGAACAACCGACCACTCCACCCTTTGGAACGCAGAGTGACGAAGCTCAGGAACCTTCTATATCTGGAGAGAGCGGAGATATGTCCATATATGTGCCAAAAACGGTAGAAGGAGAACTGAGAATCGATAACGAAGTGTTCACGCCTGCTTTAGAGGATCCCGAGAGCGACGAGTACAGGGACTTTAAAGCCAACTTTGGTGACGCGCTCAAACACGCACTGTTCAACAGGAACAGTCTAGAGAACGGTGACAACGAGATAATGGTTGAAGTTGTTCAGATAAGAAACGGCTCGATTATAGTTACATACAGAATCCATTGGATTCCAAAACACAATTCCGAACAGAAGACTGAAGAACTCTTGACACCGAACATTTTGCAAACAAAACTGGACAATTATCTAAATGATAACCACAGAATGATAAGCATCTACCATGTCGCTGAAGGAAACCTATCTACGAGACAGGTCCTAGACTTGTGTAAAATAAACAACTATGACTGCGAATACAAATGTGAATTCGACGAAGCCACCTTAGAGTTCAATTGTATTTGTCCGCCTGGTCAAATAATAGATGTCAATTCACCCAAAAAGTGCGTGACGCTACTATATGACCCTGAAAGACGAAATAAAGCGGAAACAACAACAGCGAAAATGACCTCTCTTGAACCGAATGTTAATATAATGGAATCTTCGGCTGAAGACGAAAAATCGGAATCCGAACACGATATGGAAAATACATATGATTGGAAAGATGATCGCCAAACAATGCCGCAAACGACTCCTGAAGCGGAAACTGATTTGAACTTTTCTCATCTGTTCGGTCAAACTAATTCAGAAACACCAAAACCAGAACCTGAACCGTCGTCAGAAGAAGTGAAACCAGAGCCTGAACCTGAGCCCGTGCAATCAAAGCCTTCTCCTGAATCAATACCAGAAAGCAAACCGCAATCTGAACCGCTACCTGAACCTGAACCCACATCTGAGCCGGAACCTGAACCTCAACCTACATCAGAACCAAAACCTGAACCAGAACCTGAACCCGCGTCAGAACCAAGACCTGAGCCAGAACCTCAACCCGAACCAGAGCCTGAACCCTCATCAGAACCGAAACCAGAACCTGAACCTCAACCCGAACCAAAGCCTGAACCAGAACCAGAACCGACATCAGAACCACAACCTGAACCTCCATCTCAAAAAGACCTTGAATCGACAGTAGTACCAGAACCTAATCCAATCACAGAAATTAACGCCCAATCAGAACCTAAGCCTACAATGATGTATAATTCCGAACCAAGATCAGCTCCAAATTATGGTGTAGAACCAGAAGTTACAACAGACAGCATGCAAACTATGAGAAGTATAGATGAAATCTATAACTCTCATGAAGCTACTACTATGGCGATGCCCGAGGCAGGTAGAACACTAGCACCTCTTACAAGTGATGAAATCTCTTCAATTCTTAAACAGAATGAAGAATCTGCTTCTATGACCGAAAATAATCAACCAATTTCAGACCATGCAACGTTTGATTTAGACAGTATCATTGGACTTAATTCTAATGAGTCAAGTCAGACATCGAACGACGATGCTGCAACTGAAACTAATTCTGAATTTACTCCAGTAACAATAAACAGTAATAAAAACAATATGGTTGAGAGCATTCCAATGACCACAACCATGAAAACAATTTCAGACGCAAAAATGATTGATGAAGATCCAAATGGATCAAAAATGAATATAGCTTCAGGGTCAATCGAAGCGGAACCGTTAAGTACTACTGAATTTAATATTAATAATTATACAGTCAGCGACAATGAATGGCTAGAATATGACGACAATAATAAGTCAACATCCTCAATTGATATGAATTCTGACATTAACAAAGAAATGGAATCACAAATGCAAACTGAAATAAATAACGAACAAATCACTAAAACTGAAACGAGATCATCTAAAACCTTTGGGGATTTCCTGTATGAAACCACAACCGAGAAATTGAGCGAAATGTACAATGATGATATAACATTTGATCTCATAAAGAAAAATGTTGAAATGCCCGTAATGGAAACAACTCCTTTCACGGATACGGAAAATCAAGAACCTACAGAAAATACGCTTGATATCATTGGTATAAACGCCGAAGACATGACAAGCAAAAGCAGTGTAGAAACAACAACATCTCAGAGTACTGCTAAAAGTGAAACTTACAAACCAACAACAGAAGTCGAGCCTGTAATCCTTAAGGAAATGGGATTTGGTCTAACTACTGAATCGAATGTAAAAACAAGTCCTGATAATAAAATTATTGTAGATTCGCCGACACAGGAGAAAGAATTATTCAACACAAACACGGAACCCATCGAAACAACAACAGCTAAAATGGAAATAAGTCCAATTTTTTCATTAAATGAACAAAATAAAAACACCAAAGAAGATAAAGAATTAATAAAAGCTTTAGAAATAACAACAGTTTCAACTGATGAAGCTGAAGAAACAACTACAAAAAGAAATACATCAGATGAAAGCACCGATGTAACTTTTGACACTATTAGTCTGCTTTACAACCGTTCTTCGAAATCCATGGACGATAAAGAAAACACTGCCACTGAAGTGACTATTTCAAATGACATTCAAAGCTCCTCCGGAGAAAGTTCTACGGATTCTGATTGGCTATCCGAATCCGTAACAGAAATGAACTACGAAGAGCTTGTTAATAAAATGAACTCACCCGTTACAACAGAAATGCCACTTATTAAAGTAGATGAAAACATGAGCCACGGTGTCAACAAGGACGACTTTGAACCGGATTATTTAAATATCGGCTCTAAATCCAATAAAATGACTGACCACGAGGAGCCTTTATATGGAATGATAACTGACTACGATTCTGAAGACTCCAGAGTAAAAAGAGTAAACATGGCTGGCAAAGAGAATAAAACTACTTCCATAGAAGAAACTATGACATTAAATTCCTCTCAAAATACTGTATTCGAAACGACAACAGTAAGTCAATACATTTACAAAGCAGGTGGAAGCAGTTCCAAAGAAGAAGATATCCAGCCAGTTGTATCTTCTCCAGCTCCCGTTTGGGAAGAAACAGAAGTAGAAAACCATACAACAAGTTCACCATCATCGTCTCAAGAAACAAACGAAAGTAACCAACAAAATATTAACGCAAATGTCACATCAACAACAACATCCGTACCATTTGCAACAACACCTTCAAGTAATGATCAGAATAATTCTGGAAGCGAAGCAAACCAAAATGAAATGAAGGATTCGATTGCTGAAAATACATCTCTGGTTAATAACTTAAACGTAACGATATACGAAATATCAAATACTACTGACAATCAGTCGTTTGTATCCACTAATCCTTCCAATCCATTACAACCCGAAATCATATCTCACATTGACCACGAAACAGATATGAATCCATTCTTACCTGAGGCGGAAAACAATAAAATTCTCGTTAAAAAACTCCAAGAAGGTCATGACATAGAACCAACAAATGTAAACGAAACACAGAACGAAAGTGTTGATGATCATAACACGAACACGTCAAATATCGCTGCTATGGAACCGAAGCAATCTGTAGAAGAAACTGCACAAATGAATAACCTTAATGTTATTGGTGTCGCTGGCGTCAAACCAGAAACAACAACAGCTGCAAGTTCAGAAGACGATCTTATATTTAATCATTTGTACACAAACAGTATATCTCGCGACGAAACGTTCACAACAACAGAAATAAATAAACCAGTTGATGACGTACCAACTTTAGAAGACGATAAAGATGTTTTGCCTATTTCAACATTCCTGCTAGACACGGACGATTTGGACGTAACAAAAACACCACCATCTTCATCGGAAAACGAATTAAAAAGTAGGGCTAGCATTGATGTCACCAGCTCAAAGTCCAATGAAGATGTTAACCAATTTCTAAACGTCGTACCCATAGAAGCTGAAAAAAGTAACGAAGCACTGAACAAAAACTTAAATTCTGAGAGTATCCAAGAACTAAACGACATCAGTGACTCACCAAAGAAGAGCGACAGGACGATAGACGTGAACAATTTGGAAGCTAGATACTTTTAA

Protein sequence:

>DPOGS208748-PA
MDQPPREIEKLLHKTEGLMSKDVEKGEKLYVTADKRGEKGGCKRPLCWTLLGLVVAAIVALIVLAATGILFSNSPTALEPYNSSVSSARAFSGITHVHSHDHDHSHDHNHDHDHDDHGRNEQPTTPPFGTQSDEAQEPSISGESGDMSIYVPKTVEGELRIDNEVFTPALEDPESDEYRDFKANFGDALKHALFNRNSLENGDNEIMVEVVQIRNGSIIVTYRIHWIPKHNSEQKTEELLTPNILQTKLDNYLNDNHRMISIYHVAEGNLSTRQVLDLCKINNYDCEYKCEFDEATLEFNCICPPGQIIDVNSPKKCVTLLYDPERRNKAETTTAKMTSLEPNVNIMESSAEDEKSESEHDMENTYDWKDDRQTMPQTTPEAETDLNFSHLFGQTNSETPKPEPEPSSEEVKPEPEPEPVQSKPSPESIPESKPQSEPLPEPEPTSEPEPEPQPTSEPKPEPEPEPASEPRPEPEPQPEPEPEPSSEPKPEPEPQPEPKPEPEPEPTSEPQPEPPSQKDLESTVVPEPNPITEINAQSEPKPTMMYNSEPRSAPNYGVEPEVTTDSMQTMRSIDEIYNSHEATTMAMPEAGRTLAPLTSDEISSILKQNEESASMTENNQPISDHATFDLDSIIGLNSNESSQTSNDDAATETNSEFTPVTINSNKNNMVESIPMTTTMKTISDAKMIDEDPNGSKMNIASGSIEAEPLSTTEFNINNYTVSDNEWLEYDDNNKSTSSIDMNSDINKEMESQMQTEINNEQITKTETRSSKTFGDFLYETTTEKLSEMYNDDITFDLIKKNVEMPVMETTPFTDTENQEPTENTLDIIGINAEDMTSKSSVETTTSQSTAKSETYKPTTEVEPVILKEMGFGLTTESNVKTSPDNKIIVDSPTQEKELFNTNTEPIETTTAKMEISPIFSLNEQNKNTKEDKELIKALEITTVSTDEAEETTTKRNTSDESTDVTFDTISLLYNRSSKSMDDKENTATEVTISNDIQSSSGESSTDSDWLSESVTEMNYEELVNKMNSPVTTEMPLIKVDENMSHGVNKDDFEPDYLNIGSKSNKMTDHEEPLYGMITDYDSEDSRVKRVNMAGKENKTTSIEETMTLNSSQNTVFETTTVSQYIYKAGGSSSKEEDIQPVVSSPAPVWEETEVENHTTSSPSSSQETNESNQQNINANVTSTTTSVPFATTPSSNDQNNSGSEANQNEMKDSIAENTSLVNNLNVTIYEISNTTDNQSFVSTNPSNPLQPEIISHIDHETDMNPFLPEAENNKILVKKLQEGHDIEPTNVNETQNESVDDHNTNTSNIAAMEPKQSVEETAQMNNLNVIGVAGVKPETTTAASSEDDLIFNHLYTNSISRDETFTTTEINKPVDDVPTLEDDKDVLPISTFLLDTDDLDVTKTPPSSSENELKSRASIDVTSSKSNEDVNQFLNVVPIEAEKSNEALNKNLNSESIQELNDISDSPKKSDRTIDVNNLEARYF-