Monarch geneset OGS2.0

DPOGS206291
TranscriptDPOGS206291-TA3345 bp
ProteinDPOGS206291-PA1114 aa
Genomic positionDPSCF300290 + 320994-330297
RNAseq coverage383x (Rank: top 31%)
Annotation
HeliconiusHMEL0131220.071.73% 
BombyxBGIBMGA010803-TA0.063.45% 
DrosophilaCG6511-PA5e-17343.41% 
EBI UniRef50UniRef50_F4WEA10.040.22%Erythroid differentiation-related factor 1 n=5 Tax=Myrmicinae RepID=F4WEA1_ACREC
NCBI RefSeqXP_001607933.10.039.69%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3838517620.041.07%PREDICTED: erythroid differentiation-related factor 1-like [Megachile rotundata]
NCBI nr blastxgi|3227898500.040.66%hypothetical protein SINV_10770 [Solenopsis invicta]
Group
KEGG pathway 
Orthology groupMCL13014 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206291-TA
ATGGATGACGTCGAGAATGAAAATGAGTTCAATCGAAACAAGGGGCGAAGTCCATCTCCAGGTGTTAAATCAACAGCGGTTGTAAAATATACAGCATTCCAGACTCCTGCTAGTTATGCCAGGTTACAGTGCAACACAGATATCAACCTCCCTCCATCCAACTGGGGTGGGATAGACACTTACGGCTTGAAGCAGATACTCACAAGAGATTCGGGACTTTCAAGCTTCAGAATGGCTCACATGTTCCCCGACTGTGTCGGAGAGGTGGACGTTATATCTGATGCAGATTGCATAAAGAATCTCCTCAAGCTACCCTACCAACCTAATGGAACTGTTAGTATGATGGTGCATAGAGTTGAAAATACATTGCTATTGGACGACTTTGATGTCTACGAATACCTGATGAAGTCGGAGTGGTCCTGGTTAAAGGATTTCTTCTACGAGAATGTACTGAAGACTATGTCGGAAAAGGATCGTATCTCTCTGACGTCATCGGCTAGTAGGAGTGCCCTCCAGCTGACGCACAAGTTCCTATCCCACAGTGTGGTGGCGCCACCTCTGCCAGCCAGCCAGCCCTGCCAGCCGATATGTTTACCAGGACCATTTCTCCCGGAGCCGGAGACTCGTCCGGAAGAACCAGCTAAAGAACAGAGTTTCAATAGGAACGTGCTCTGGACATTCGAAGACATACACATGTTGATAGGTAGCAACCTACCTATTTTCGGCGACAAGGACAGGCCTTGCGTCAGTTTGCGTTTACGAGACGCGAGGGAACCGATAAACGTACTCACGGGCATCGACTACTGGCTGGACAACCTCATGTGCAACGTCCCTGAGGTTCTGATGTGCTACCACTTGGACGGTATCGTGCAGAAATACGAGCCAATGAAGACAGAAGATTTGCCGCACATGGAGAACTCCAAGTTCTCGCCAAAGGTTATAAGGAATGTCGCACAGAATATTCTATCGTTTTTAAAATCTAACGCGACTAAGGCCGGTCACACATACTGGTTATTCAAAGGCCCCCACGACGATGTTGTCAAGTTGTACGACCTGACGACTCTTTGCCCCGATGACATGGACAATCCCTTCACGACGCCCGTGGCGATGCTATTGTACAGGGTGGCCAGGAACATGAGGATGATGAACAGGTCCAAACATGTCCGGCAGCTGCTGGAACACGTCGTGGAATTGCTCGGAAGCGAGAGATACCCTCAGATTGTAGCGTCCTCGCATTATATGCTGGCTGATCTGTATGTACCCGCCACCACAAACCCGGCACATCCAGATTTCAAAGACGAAAGCTCGGACTCCGAAGAGGAAGCTGAGTTTGGTAACTACGCGGAGTGCCCCTCGGCCGACAGGGGCAGACGGACAGACAAGGACGACGAGATTGTACGAGACGTCACTAATGACGATAAGTGCGAGGGAGATGGAAATATAAACCGGGAAGAGTGCGAGCGAGACGGCGATAGTGCTGGCGAACTCACTCTCCGAGTACGGGGTCTGGCGTTAAGAGACATCGGCGATAGGCAGACACACGATACTACGAAGAAAACTAAGAGATCCACTACAGGACTGGGCATCGAACCCGCTACCAGATGTGGGCGGGCGCTGAAACACGCGCTCACCGGACTCAAGGCTCTACATCATCTGACCATAGATAAATCTATGGAAGAGGAAAGAGAACGTCTGAGACAGCAGAAGATCAAAGAGGAACAACATCCGAAAATGGCCAATCCTTACGAACCCATCAGAATGGGCTACAAGACGTCCAAGCTAAAAGATAAAGAGCACACCTCGAGGAGCAGGCGTCGGCGGACGAGGCGGAACTCGTCTAACCACATAGAGACCAACTCGAACGTAGACAAAAACGCTATTTTAGTGCGGAAAGAGAACACGATAACCCTGCAAGAACCGAACCGGGACGACAACTTCGCCTGGAAACTACATCTGAAGACGCTGCTGTATGAAAAAATATGCCTCGCGTACGCCACGCTAGCCGAATACAGCTACTCACACGAGCAGTACGGCTTCTCCTTGAAGTATATAGATCTGGCCAGCAAATGCCAGAAGCTGTTGAGCAATATGATCATCAAGAGTCGCGTGGTGGACGCCAGCTGTCTCATAGGCAGGGTCGGCGACAACTACTTCCAACTGAGCAAACACTGGCCCAGCTTGGACCAGTACAGCAAACAGTTCGGCATGGACCACGAGATCGATAGGGAGATAAGAAACGAGATAGAGAGCGATATGGCGGAAGAAATGGAGGGCTTCGGGGGAGATGAGTTTGAACTGGAGATATACATGTCGTCGTTGGACACTTCGGATACGTTGCCGGAGGAGTTTCGTCATCTGTCCAAAAAAGCTGCCGAATATTTGGACGAGGCTACTGAAATATTCCAACACGTGAACGACGTCCCCAACCTGGCCTTGTTGTACTGCAATAAGGCCAGATACATGAGGTTCAAAGTTCACTGCGACAAAGGAGTTTTCGATGATGAGAAGCGTCGGACTTATAATTCAGCCGAAGAGCTGTACTCCCAGGCGCTCAGGCTGGTGGGATCTCGGGAGGCAGCTATCAAAGACCTGGTATGCTGGGAATTGTCCTGCCACCTGTACACTAGGGCTGTACTCTTGCAGGACCATCCGGAAATCTATGCTAGCGAAGTTACAGAAGTAGCGGAGGCTTTCAAACACGCTCTGAAGCACTGCCTGTTGAGTCCGGGCCCGAGACAGTACTTGTATCAATTCAGAGCCGCTATGATATATCACCGGCTGGGATCGCTGTACCATTCACAGTACAGGAAGAGCCAAGACCCCTCCATCCGCCGGCGCATGTTATCCGCGACGTGTTCCCACTATGAGAAGGCGGCTCTCCAATTCGCCTCCCTCGAGGACCCCGCCATGTTCCTCACAGCACGACTCGAGCACATTGCAGCATTAGAGGCACACGCCGCAGTGTCGCCGAATCTGAAGTTGAAGTCGCTTCAAAACGCAATAGACTTACTTCGTCAGTGTCACTCAATAATGAAGCTGTTAAAAGATAGAGATCCGGACGAAAAGAAAGAGAAAGATAAACCAGAAGACGGCGACGAGAAAAGTCTGAAAAACGAACATAGTTTACTGAGCTTATACGAGAATAGGCTTCACTATATTTTGAAAAGTATCATACAATACTGCAGATCGAAGTCCAACAAAGACTATGACAAGATGACAGAGATGTACAAGAAGCTGTACAGCGCGTCCCTGAAGATAAGGAGAGACGAGGACGTGCGGCTGTACGCGGCCAGTGTGTGCGACGTGCTCGCGGCCATGGACAGCATCATAAGCGAGTTCCAGTAG

Protein sequence:

>DPOGS206291-PA
MDDVENENEFNRNKGRSPSPGVKSTAVVKYTAFQTPASYARLQCNTDINLPPSNWGGIDTYGLKQILTRDSGLSSFRMAHMFPDCVGEVDVISDADCIKNLLKLPYQPNGTVSMMVHRVENTLLLDDFDVYEYLMKSEWSWLKDFFYENVLKTMSEKDRISLTSSASRSALQLTHKFLSHSVVAPPLPASQPCQPICLPGPFLPEPETRPEEPAKEQSFNRNVLWTFEDIHMLIGSNLPIFGDKDRPCVSLRLRDAREPINVLTGIDYWLDNLMCNVPEVLMCYHLDGIVQKYEPMKTEDLPHMENSKFSPKVIRNVAQNILSFLKSNATKAGHTYWLFKGPHDDVVKLYDLTTLCPDDMDNPFTTPVAMLLYRVARNMRMMNRSKHVRQLLEHVVELLGSERYPQIVASSHYMLADLYVPATTNPAHPDFKDESSDSEEEAEFGNYAECPSADRGRRTDKDDEIVRDVTNDDKCEGDGNINREECERDGDSAGELTLRVRGLALRDIGDRQTHDTTKKTKRSTTGLGIEPATRCGRALKHALTGLKALHHLTIDKSMEEERERLRQQKIKEEQHPKMANPYEPIRMGYKTSKLKDKEHTSRSRRRRTRRNSSNHIETNSNVDKNAILVRKENTITLQEPNRDDNFAWKLHLKTLLYEKICLAYATLAEYSYSHEQYGFSLKYIDLASKCQKLLSNMIIKSRVVDASCLIGRVGDNYFQLSKHWPSLDQYSKQFGMDHEIDREIRNEIESDMAEEMEGFGGDEFELEIYMSSLDTSDTLPEEFRHLSKKAAEYLDEATEIFQHVNDVPNLALLYCNKARYMRFKVHCDKGVFDDEKRRTYNSAEELYSQALRLVGSREAAIKDLVCWELSCHLYTRAVLLQDHPEIYASEVTEVAEAFKHALKHCLLSPGPRQYLYQFRAAMIYHRLGSLYHSQYRKSQDPSIRRRMLSATCSHYEKAALQFASLEDPAMFLTARLEHIAALEAHAAVSPNLKLKSLQNAIDLLRQCHSIMKLLKDRDPDEKKEKDKPEDGDEKSLKNEHSLLSLYENRLHYILKSIIQYCRSKSNKDYDKMTEMYKKLYSASLKIRRDEDVRLYAASVCDVLAAMDSIISEFQ-