Monarch geneset OGS2.0

DPOGS200451
TranscriptDPOGS200451-TA1914 bp
ProteinDPOGS200451-PA637 aa
Genomic positionDPSCF300260 - 243067-255197
RNAseq coverage232x (Rank: top 44%)
Annotation
HeliconiusHMEL0044680.066.05% 
BombyxBGIBMGA011231-TA0.065.06% 
DrosophilaCG10069-PB1e-13748.56% 
EBI UniRef50UniRef50_Q7Q5I81e-13444.14%AGAP006413-PA n=19 Tax=Endopterygota RepID=Q7Q5I8_ANOGA
NCBI RefSeqXP_968141.29e-14951.48%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|1892383512e-14751.48%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|1571339642e-14450.00%hypothetical protein AaeL_AAEL003064 [Aedes aegypti]
Group
Gene OntologyGO:00550852e-29transmembrane transport
GO:00160212e-29integral to membrane
KEGG pathway 
InterPro domain[71-627] IPR0161962e-57Major facilitator superfamily domain, general substrate transporter
[161-560] IPR0117012e-29Major facilitator superfamily
Orthology groupMCL12884 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200451-TA
ATGAACGGTGTCAGCCGAGACGCACCTGTTGGTATACAATGCTTACAAAAATTCAGTTCCTGGATGTGTCCGCGGTTACAGATAAATAGATTGCGATGGTATCAAGCCAGTGTTCTGGTGTTGACGTACTTCACCTATATGACGTACCACCTGACGAGGAAGCCGATATCCGTGGTGAAGAGCATTCTCCATCAGAACTGCAGCGGCTTGACGCCACCCCCGGACATCAGACCCGGGGACGACCAATGGTGTAATTGGGCGCCGTTTAGTAAGTATCAAGCCAGTGTTCTGGTGTTGACGTACTTCACCTATATGACGTACCACCTGACGAGGAAGCCGATATCCGTGGTGAAGAGCATTCTCCATCAGAACTGCAGCGGCTTGACGCCACCCCCGGACATCAGACCCGGGGACGACCAATGGTGTAATTGGGCGCCGTTTAACACACATGATGCTAATACGCTATTGGGTACACTGGATTCAGCTTTCTTGTTCTGCTACGCCGGAGCCATGTTTATATCTGACACACATGATGCTAATACGCTATTGGGTACACTGGATTCAGCTTTCTTGTTCTGCTACGCCGGAGCCATGTTTATATCTGGGATGATAGCTGAGAGAGTTGATCTAAGATACTTCCTGTCTCTCGGCATGTTAGTGTCCGCTGTATTCTGCTACTTGTTCGGACTAGGTCGGACCTTGGATATACATGATATATCCTTCTATCTAATTGTGCAGGCAGGTGCTGGAATAGCTCAGACAACAGGCTGGCCAGGCACGGTCGCCATAGTCGGCAAATGGTTTGGGAACGCAAAAAAAGGTCTTATATTCGGTCTTTGGAACTCGCACACGTCATTAGGAAATATTTTAGGTACAATAACTGCAGCCAAGTATGTGGAATACGACTGGTCTCTGTCATTCATCTATCCGGCCTTAATAATGGGGGTAGTTGCGTTTATTGTTTTCCTGTTCCTGGCTCCTGAGCCGAAGTATGTCGGCATCACGACTGAGAGGATATCTCCAAGTAGAGTGTCCCACAGTTCAGACGAAGATGTGTCCGAAGTGATCGTTGGTGATCAGTATATACTCCTTCAATCCGCCTCTTTGTTTGAACACTGCTTTGTCTATGCCCCACTCATTAACAGCATGTTGCAGGCGGCGAGTCTCCGACACTCACGATACTCGGCCAACACGCATCACTCAGACGAGGTGACAGAGGAAACGGGTCTTCTATCTAACCGGCCGAGAGCTGGTGCTGTGTCCCTGACACGTGCGCTGGCTATACCCGGAGTGATAGAGTTTTCACTGTCACTTTTCTTCGCTAAACTAGTCAGCTACACCTTCCTCTACTGGCTGCCGATGTACATTAAGAGTTCCACTAATCTAACTCCAAAGCAGTCTAGCGAGTTGAGCACAGCGTTTGACGTGGGTGGCGTGGTGGGCGCGGCTCTAGCCGGGTTACTGGCGGACTGGGCGGGTTGTCCTGGTGTTGTATGTGTGGGATTTTACGCATTATGTGTGCCCACACTGCTCGCGTATCTCCAATGGGGTGCGACCACGTACATTCTGAACGTGTGCCTGCTAGTTGTGGCCGGAGTCCTAGTTAATGGACCATACGCTCTCATCACAACGGCCGTCAGCGCGGAACTAGGTACACACAGCAGCTTGGCTGGTGACGCCCAAGCATTGGCCACTGTCACGGCCATCATTGACGGTACTGGTAGTATCGGCGCTGCTGTTGGGCCACAGATGGCCGGCTTGGTATCTGGAGTCAGTTGGTCTTATGTGTTTTACATGTTGGTGATCTGCGACTTCTTGGCACTTGTTCTTCTCTTGAGGATATCAATGTCGGAGATAGCAAGGATACGACAAGAGCGCCGCCTGGCGTCGCCTCGTTTAGTTAGAATTGAATGA

Protein sequence:

>DPOGS200451-PA
MNGVSRDAPVGIQCLQKFSSWMCPRLQINRLRWYQASVLVLTYFTYMTYHLTRKPISVVKSILHQNCSGLTPPPDIRPGDDQWCNWAPFSKYQASVLVLTYFTYMTYHLTRKPISVVKSILHQNCSGLTPPPDIRPGDDQWCNWAPFNTHDANTLLGTLDSAFLFCYAGAMFISDTHDANTLLGTLDSAFLFCYAGAMFISGMIAERVDLRYFLSLGMLVSAVFCYLFGLGRTLDIHDISFYLIVQAGAGIAQTTGWPGTVAIVGKWFGNAKKGLIFGLWNSHTSLGNILGTITAAKYVEYDWSLSFIYPALIMGVVAFIVFLFLAPEPKYVGITTERISPSRVSHSSDEDVSEVIVGDQYILLQSASLFEHCFVYAPLINSMLQAASLRHSRYSANTHHSDEVTEETGLLSNRPRAGAVSLTRALAIPGVIEFSLSLFFAKLVSYTFLYWLPMYIKSSTNLTPKQSSELSTAFDVGGVVGAALAGLLADWAGCPGVVCVGFYALCVPTLLAYLQWGATTYILNVCLLVVAGVLVNGPYALITTAVSAELGTHSSLAGDAQALATVTAIIDGTGSIGAAVGPQMAGLVSGVSWSYVFYMLVICDFLALVLLLRISMSEIARIRQERRLASPRLVRIE-