Monarch geneset OGS2.0

DPOGS203600
TranscriptDPOGS203600-TA1410 bp
ProteinDPOGS203600-PA469 aa
Genomic positionDPSCF300063 - 583355-587742
RNAseq coverage333x (Rank: top 35%)
Annotation
HeliconiusHMEL0088790.071.28% 
BombyxBGIBMGA007270-TA0.087.71% 
Drosophilal(2)01810-PA4e-14151.93% 
EBI UniRef50UniRef50_B2DBK31e-14056.36%Similar to CG5304-PA n=3 Tax=Papilionoidea RepID=B2DBK3_9NEOP
NCBI RefSeqXP_317786.37e-15656.90%AGAP007732-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3123794745e-15556.96%hypothetical protein AND_08672 [Anopheles darlingi]
NCBI nr blastxgi|3123794747e-15759.51%hypothetical protein AND_08672 [Anopheles darlingi]
Group
Gene OntologyGO:00550853.3e-57transmembrane transport
GO:00160213.3e-57integral to membrane
KEGG pathway 
InterPro domain[1-437] IPR0161962.2e-75Major facilitator superfamily domain, general substrate transporter
[6-401] IPR0117013.3e-57Major facilitator superfamily
Orthology groupMCL10166 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203600-TA
ATGGCTATCATGGGTTTCCTGGCCGTGGCGAACGCGTACACCATGCGTGTCTGCCTGAACATTGCTATCACACAGATGGTGAGGCGACATTCCCCCTCACCAGCCTACGAAGACGGCTCCTGCCCTGGGGGTTTGGACGAAGTTGTTGAAGATGCAACGGAGATCACTGGGTATAACTGGAATGAGGAGACACAAGGAATCATATTAAGCGCGTTCTACTATGGTTACATTGTTACTCATTTACCTGGAGGTATGCTGGCAGAACGATTCGGAGGCAAATATTCCCTTGGCTTCGGAGTACTCAGCACAGCTGTGTTTACTCTCTTAACGCCCTGGGCTGTGAACCTGGGAGGCGCCACGGGGCTTATTATTTTAAGAGTTCTTGAAGGACTCGGAGAGGGAACGACATTCCCTGCATTAAATGCCATGTTGGCTCGATGGTCTCCCGTGTCCGAGAGAGGACGGATGGGGTCCCTAGTATTCGGGGGGGCTCAAATAGGAAACATTGCTGGAACTTACTTCTCTGGACTTGTCATCAAGGAGACTGGCGAGTGGCAATCAGTTTTCTACTTGTTTGGGAGCATTGGGATCCTATGGTTTATTTTATGGGCTCTTCTTTGCTATAATGATCCAGAATCCCATCCTTACATATCTGATAAGGAAAAGAAATATTTAGAAGAAGCGCTCGGAAGACACCACAACACTCAGCCCTCGTCCATTCCATGGAAAGCTATTTTCATGTCAGTTCCACTATGGGCTTTAGTTTGCGCACAGATCGGACATGACTACGGCTACTTCACAATGGTCACCGACCTACCCAAGTACATGACGGGTGTATTGAAGTTCGACATCCATCGCACTGGCACCCTGGCTGCGCTGCCATACGCCGTCATGTGGCTGAGCTCCATCGCTTTCGGTTGGATATGCGACAAAATAGTCAAGAGGCAATGGATGACAGTCACAAACGCTAGAAAGACCTTCACTACCATTGCGTCTGTCGGACCGGGTATTTGTATGATCCTAGCTTCCTACTCAGGCTGTAATACTGAGACTGTTGTGATACTGTTTACCGCGTCCATGGGCCTGATGGGAGCTTTCTATCCGGGCATGAAAGTGAATGCGTTGGACTTGAGCAACAACTATGCTGGTACCATTATGGCGATCGTGAACGGCATTGGAGCGATTACAGGCATCATAGCCCCCTACTTGGTCGGATTGTTGACACCTGATAGTACGTTAACTCAATGGAGATTGGTGTTTTGGATCACTCTGGCGGTGTTCATAGTGACAAATTTAGTGTTCGTGGCGTGGGCGTCTGGCGAGGGACAGTGGTGGGACACCTGCTCCCAGGATCCGAGGAAGCAAGACGAGAATACGAATCAATCAGTCAACAATGACGTCAAATTGTAA

Protein sequence:

>DPOGS203600-PA
MAIMGFLAVANAYTMRVCLNIAITQMVRRHSPSPAYEDGSCPGGLDEVVEDATEITGYNWNEETQGIILSAFYYGYIVTHLPGGMLAERFGGKYSLGFGVLSTAVFTLLTPWAVNLGGATGLIILRVLEGLGEGTTFPALNAMLARWSPVSERGRMGSLVFGGAQIGNIAGTYFSGLVIKETGEWQSVFYLFGSIGILWFILWALLCYNDPESHPYISDKEKKYLEEALGRHHNTQPSSIPWKAIFMSVPLWALVCAQIGHDYGYFTMVTDLPKYMTGVLKFDIHRTGTLAALPYAVMWLSSIAFGWICDKIVKRQWMTVTNARKTFTTIASVGPGICMILASYSGCNTETVVILFTASMGLMGAFYPGMKVNALDLSNNYAGTIMAIVNGIGAITGIIAPYLVGLLTPDSTLTQWRLVFWITLAVFIVTNLVFVAWASGEGQWWDTCSQDPRKQDENTNQSVNNDVKL-