Monarch geneset OGS2.0

DPOGS203222
TranscriptDPOGS203222-TA1782 bp
ProteinDPOGS203222-PA593 aa
Genomic positionDPSCF300035 + 973152-976509
RNAseq coverage214x (Rank: top 45%)
Annotation
HeliconiusHMEL0065040.087.25% 
BombyxBGIBMGA011512-TA5e-12669.46% 
DrosophilaCG7218-PA4e-16250.60% 
EBI UniRef50UniRef50_E0VJ710.056.32%Putative uncharacterized protein n=2 Tax=Neoptera RepID=E0VJ71_PEDHC
NCBI RefSeqXP_971007.10.059.09%PREDICTED: similar to AGAP002775-PA [Tribolium castaneum]
NCBI nr blastpgi|910763380.059.09%PREDICTED: similar to AGAP002775-PA [Tribolium castaneum]
NCBI nr blastxgi|910763380.059.09%PREDICTED: similar to AGAP002775-PA [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[48-447] IPR0080107.5e-223Membrane protein,Tapt1/CMV receptor
Orthology groupMCL14040 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203222-TA
ATGACACTGAAAAATGACGAGTTGCAAGTACACAATAAAAGGTTACGTTTTAAGAACGTAACCAATATACAACATTGCTCCGGTGATATCCCTGATAGAAGTGGAAAGGGGCCAAAAATCAAAGATGGTGATAAAAGCGCTTCTCTTGTTACATTCTTGCACGTTGAACTTACTCGTGGCTATTTACTTGAGCACGATGAAGAAAGGTTTTCGGCTCGAAGAGAAAAAGTTTACTCTTTCATCAAAATACCACAGGAACTGGAAAAATTTATGGCATATGGATTTTTCCAATGTGCAGATTCTCTGTTATTTGTTTATACTTTTTTACCCTTAAGATTTGTTATGGCATTCTGGTCATTTTTTACCAGACTCTTTAGACAGTGCTTTGGGTTTAATTCACAAAAAAAGCAAAGTATATTGAAGCCGGCCGAGACATGTGATGTACTTAAAGGTTCTATATTATTAGTATGTAGTATATTAATGTGCTATATTGATACAAACATGATGTATCATCTAGTGAAAAGTCAGAGTGTTATGAAGCTGTATATATTTTATAATATGTTGGAAGTCGGAGACAGATTGTTTTCCGCTTTCGGTCAAGACACAATTGATGCTCTCTTTTGGACAGCCACAGAACCTCGGGACAGAAAAAGGGAACACCTCGGACTTATTCCTCATTTAATATTTGCTATCATATATGTCTTTCTTCACAGTTTATTAGTACTGTTCCAAGCGACAACACTCAATGTGGCTTTTAATTCCAATAATAAAAGTCTTCTTATCATTATGATGTCAAACAATTTTGTGGAGTTAAAAGGCAGTGTATTTAAGAAGTTTGACAAGAATAATCTCTTCCAAGTGTCATGCAGTGATGTGAGGGAGCGTCTTCATTTGTCAGTGTTGCTTTTTATAGTAGTCTTACAAACTATGAAGGAGTATATGTGGAAAGAAGAGAGATTCTGGATATTGGCACCCGATTGCGTACTCGTTTTAACATTTGAAGTTATCATTGACTGGGTGAAACATGCTTTTATAACCAGGTTTAATGAGATACCGTATGGCGTGTACCGTGAGTACACTGTGAGTTTGGCTTATGACGTTGCTCAAACCCGTCAGAAGTACGCATTCAGTGATCATTCAGATCTCGTTGCAAGACGTATGGGCTTCATACCACTACCACTTGGAGTTGTTATAACAAGAGTTTTAGTACATGCCGTTAAAATTGATGGCCTGGCAGCAATACTTTTGATATTCATAGCATATCTATGTCTAATATCAATAAGAATTTTAATATCCATTGTGATACTTGGCAAGGCCTGTGATCTGATCACACAACATCAAAATGACAAAAGTGATAGCCATCATGCAACACCTAAAAAAGAACAAAAAGATTTCAAGGATGCATCGCATAAAACAGGTGAATGTGAAATGAGTTCTGAGAAGAAATCTATGCAGTTGAAAATTGTTATGCCGGAAAATGTTATGATCGAGCCTCTTGTTGATGCTTCGGTCGGAGCAGCAGCTATCTTCTCAAACAGTGCTATAGATTTGAACGGAGTCTGTTATCTTAATGATAAAATGAATGTCCAAGTTAAGCAGGAACCTGAAGATTTCATTGAACCTGAATTGGACGTTAGCCGCAGTGCTCCAGACATAAAAGCAGCTGCGGCCGCCATAGAGGAATCAGTTGAGAGGCCGCAGTCGCCGGACGCGGACGGGCTCAAGCGACGCGCTGAGTCCGAACCGAATCTAGCAAAAAACGACGAACAGGAGAACACGTAA

Protein sequence:

>DPOGS203222-PA
MTLKNDELQVHNKRLRFKNVTNIQHCSGDIPDRSGKGPKIKDGDKSASLVTFLHVELTRGYLLEHDEERFSARREKVYSFIKIPQELEKFMAYGFFQCADSLLFVYTFLPLRFVMAFWSFFTRLFRQCFGFNSQKKQSILKPAETCDVLKGSILLVCSILMCYIDTNMMYHLVKSQSVMKLYIFYNMLEVGDRLFSAFGQDTIDALFWTATEPRDRKREHLGLIPHLIFAIIYVFLHSLLVLFQATTLNVAFNSNNKSLLIIMMSNNFVELKGSVFKKFDKNNLFQVSCSDVRERLHLSVLLFIVVLQTMKEYMWKEERFWILAPDCVLVLTFEVIIDWVKHAFITRFNEIPYGVYREYTVSLAYDVAQTRQKYAFSDHSDLVARRMGFIPLPLGVVITRVLVHAVKIDGLAAILLIFIAYLCLISIRILISIVILGKACDLITQHQNDKSDSHHATPKKEQKDFKDASHKTGECEMSSEKKSMQLKIVMPENVMIEPLVDASVGAAAIFSNSAIDLNGVCYLNDKMNVQVKQEPEDFIEPELDVSRSAPDIKAAAAAIEESVERPQSPDADGLKRRAESEPNLAKNDEQENT-