Monarch geneset OGS2.0

DPOGS212266
TranscriptDPOGS212266-TA2229 bp
ProteinDPOGS212266-PA742 aa
Genomic positionDPSCF300077 - 490027-514056
RNAseq coverage1529x (Rank: top 8%)
Annotation
HeliconiusHMEL0149280.073.67% 
BombyxBGIBMGA011576-TA0.076.61% 
DrosophilaCG13907-PA0.055.80% 
EBI UniRef50UniRef50_B0W6P20.058.28%Monocarboxylate transporter n=8 Tax=Pancrustacea RepID=B0W6P2_CULQU
NCBI RefSeqXP_312912.30.058.47%AGAP003206-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479694640.058.47%AGAP003206-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479694640.058.64%AGAP003206-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00550853.1e-21transmembrane transport
GO:00160213.1e-21integral to membrane
KEGG pathway 
InterPro domain[25-698] IPR0161967.9e-51Major facilitator superfamily domain, general substrate transporter
[71-256] IPR0117013.1e-21Major facilitator superfamily
Orthology groupMCL15970 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212266-TA
ATGTCAAAGGTTTCTCTTAGTAATATACCATCGAACCCCCGTATAAATCGTTTGAGAACCTTTTCAAGGACTGAAAGCGAGACATCATGCGAAGAGGCGGCGCGTCTGACAGCGGAAGGTGCAGCTGATGACGATGATGAAGCCTACGACTACGGGGAGCTGCCACCACCACCAGACGGCGGCTACGGCTGGGTGGTGGTGTTCGCCTCCTTCATGTGCAACCTGGTGGTTGACGGTATCGCATACACCTTCGGAATATTCCTCCCCGAACTGGTCACGTACTACGGCGAGGGTAAAGGAACCGTAGCGTGGGTTGGAAGTCTGCTATCTGGAGTGTATCTTGCGGCTGGTCCTATTGTATCTGCGTTATGTAACAAATATGGCTGTCGAGCGGTGTGTGTTGCTGGCAGTTTGGTCGGCTCTGTTGCCTTCGTTCTGTCAACCTTCAGCAAGAGCGTCACTATGATGATGATCACATATGGACTTATTGGAGGCATGGGTTTCGGTATGATCTACCTGCCGTCTGTGGTTGCTGTGGGTTATTATTTCGAAACCCGAAGGTCACTAGCCACCGGTATAGCTGTGTGTGGTTCAGGCGTGGGTACCTTCAGCTTCGCCCCTCTAGCTGCAATCTTGCTCAACTACTTCGGATCCTGGCAGAATGCTAATTTACTACTGGCCGGTTTGATATTAAACTGCGCTGTATTCGGGGCCCTGATGAGACCGCTTGTTTATCCCAAAACATCAGGCGAAAAGCCTCTCCTCCAAAGAATGGCGGAAGAAAAGAGGTTGCAAATGGAAAGAGGCTCTATAGGCGGCTCGTACTTCGTTGTGCAATTACCGGATGGTACAATGGAGAAAAGATTAAAAGCGCCACTGAATATTGATCCAGGTGTACATTCTTCCCTGAATCTGGAAGCTTTAGCTCGTGTACCCACCATCCCAAACATGCCGGGTGTACCCACAGTGCCAACACTTCCAACTATCACCGAGGCAAGAGTGGTCGATGAAAACGTTGACAAGAAAAAGAACGAAAACGGCAGCGCACTAAGTCCTAGCCAGCAGGAGCAGCAAGCTATGTCCAGGAATGTTTCGTCTCCAGCCTTCAGCGCCACGGCTCCGGGAATACCTAAAAACGGTTCCGTGCCTTTCTTCGACCGTCAGCGGAAACACAGCTCCGCCGACAGATTTAAACCGTCCCTAGCTGCTATAAAAGCCACGTCCAAAACTTCAATGAGCAGCCACCGCGGCGACGGTGATGCGGAGAGCAACATGTATACATCAAAATTATCAGTATCCGCCAAGGAGCCGTCCCGCATGGTTCGTCCGCTGTCCCGTAAGGACATCTTCTACTCCGGGTCCGTCATCAACCTGCCGCAGTATCAGAGCCAGAAGTCGCTCCAGGGCTACAGAAACTCGGTGCTGAGTCTACCGCAGAGCAGGCAGGCTGGGGATCTCGAACGACAAGAACAATACGATTTATGCCCATGCCTGTCGCTGCCCGAATCCTTCAAAGCGGCCTTATCCTCAATGCTCGACGTGAGCCTGCTGCGAGACCCCGCCTTCATGTTGATAGGAGTGTCCAACGTGTTCGGTATGGCCGGGCTGTATGTACCCTTCGTGTACATAGTGGACGCCGCTCAGATGACTATCTCTGTCGGCGTTACTCCATTCTGCACCACCTACGCGGCGTACGTAGCTGTTGCGATTGCTTTTGGAATTGCTATTTCTGGCTATATATCTCTGACGTCCATCATCCTGGTCGACCTGCTGGGTCTTGACAAGCTGACGAACGCCTTCGGTCTCCTGATCCTGTTCCGTGGAGCCGCTGCCATCATAGGCTCGCCCCTGGCCGGCGCCGTCTACGACGCTACGAGGAACTACGACGCCTCCTTCTACATGGCCGCAGCTTTCTTCCTGGCCGCTACGCTGACGTCATTCGCCGCACCCATGTTCCGAAGGCACGTATTGATTATGGTTTTTCTATTTATCCGACCCGTTTATCTAGAAAGAAAACAAGAGGAGGTCCAACAACCGATGGACGTGCTGACTCCAATAGATGAGGATTTGGAAGAAGGTGAGGAAGACGATCCTGAAGACACCCCCATAACTATGGGCGCTCATTTAGCGTCGAGGCCTCCTGCCATCACGAGAACAGCCGCCTCGCCTTCCGACCCTCCCTCCCCTAGTCCGCCAGAGGAGCGCCCTCAGAGGGAGAGCGTCCTGTGA

Protein sequence:

>DPOGS212266-PA
MSKVSLSNIPSNPRINRLRTFSRTESETSCEEAARLTAEGAADDDDEAYDYGELPPPPDGGYGWVVVFASFMCNLVVDGIAYTFGIFLPELVTYYGEGKGTVAWVGSLLSGVYLAAGPIVSALCNKYGCRAVCVAGSLVGSVAFVLSTFSKSVTMMMITYGLIGGMGFGMIYLPSVVAVGYYFETRRSLATGIAVCGSGVGTFSFAPLAAILLNYFGSWQNANLLLAGLILNCAVFGALMRPLVYPKTSGEKPLLQRMAEEKRLQMERGSIGGSYFVVQLPDGTMEKRLKAPLNIDPGVHSSLNLEALARVPTIPNMPGVPTVPTLPTITEARVVDENVDKKKNENGSALSPSQQEQQAMSRNVSSPAFSATAPGIPKNGSVPFFDRQRKHSSADRFKPSLAAIKATSKTSMSSHRGDGDAESNMYTSKLSVSAKEPSRMVRPLSRKDIFYSGSVINLPQYQSQKSLQGYRNSVLSLPQSRQAGDLERQEQYDLCPCLSLPESFKAALSSMLDVSLLRDPAFMLIGVSNVFGMAGLYVPFVYIVDAAQMTISVGVTPFCTTYAAYVAVAIAFGIAISGYISLTSIILVDLLGLDKLTNAFGLLILFRGAAAIIGSPLAGAVYDATRNYDASFYMAAAFFLAATLTSFAAPMFRRHVLIMVFLFIRPVYLERKQEEVQQPMDVLTPIDEDLEEGEEDDPEDTPITMGAHLASRPPAITRTAASPSDPPSPSPPEERPQRESVL-