Monarch geneset OGS2.0

DPOGS211152
TranscriptDPOGS211152-TA1506 bp
ProteinDPOGS211152-PA501 aa
Genomic positionDPSCF300007 + 8964-19580
RNAseq coverage165x (Rank: top 51%)
Annotation
HeliconiusHMEL0171890.085.09% 
BombyxBGIBMGA003138-TA0.081.25% 
DrosophilaMct1-PA4e-15050.68% 
EBI UniRef50UniRef50_Q9W5096e-14850.68%EG:103B4.3 protein n=8 Tax=Pancrustacea RepID=Q9W509_DROME
NCBI RefSeqXP_002040477.14e-14950.85%GM19211 [Drosophila sechellia]
NCBI nr blastpgi|1953478768e-14850.85%GM19211 [Drosophila sechellia]
NCBI nr blastxgi|1700678402e-14953.07%monocarboxylate transporter [Culex quinquefasciatus]
Group
Gene OntologyGO:00550851.7e-27transmembrane transport
GO:00160211.7e-27integral to membrane
KEGG pathway 
InterPro domain[1-495] IPR0161961.8e-59Major facilitator superfamily domain, general substrate transporter
[8-391] IPR0117011.7e-27Major facilitator superfamily
Orthology groupMCL16099 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211152-TA
ATGGTCGTGTTCGCTTCCTTTATGATTCATATTGTCACCGATGGTATGACTTATTCGTTCGGCGTCTTCTACGCCGAGTTTCTCACCTACTTCAACGAGGGACAGAGCAAAACGGCGTGGATCGTCTCAATCCTTGTGGGGGTCACTCTCAGTTCAGGTCCTGTATCTAGTTCCTTTGTGAACCGCTGGGGTTGTCGGACTGTTACTGTGGCTGGTGCGTTACTGTCGTCGGTCTGTGTGGTCGCATCGGCGTTCGCTAAAAACGTTACAACCCTTATATTTACGATTGGCGTAGGAGCCGGAGTAGGATTCGGCCTCATCTACCTACCCGCCATTGTTAGCGTCACTGTATGGTTTAAAAGATATAGGAGTCTCGCCACAGGGATCGCCGTTTGTGGTTCAGGACTGGGAACATTTCTTTTCGCCCCCATAACTGACGCTTTAATATCAAATTATGGATGGCGTGGAGCTATGGCGCTCATTGGAGCCTTCATACTTAACTGTGTTCCTCTGGGGCTCATGTTCAGAGCAGTTCCTGAAAGACCGATCACTCCAATCACGGAGCCAATGCTACCGAAGCAAAACAAATCACCTTTAAAAAGATCCCAAAGCTCGGAACAGGTCGTTCGGGCGAACGGAAAATTTGAAGATGATGTAGCTAGATTAACACTTTCCCAACCTGCACTAAATAAGCCACACGAACAACAAAAACCTCATTCACGCCAAGGAAGTGGAATAATGCAAAGACCTGACGTTTTATATCAAGGCAGTATGACGAGTCTCGCTAGATTCAGAGGAGTATCTCCAGAAAGAGTACAGCTTTCTTTTAAAAGAGAAGAACGCGAACAAAGCTGTGGCTGGCTGCCGTGTTCAGAAGAATCGAAAGCAGCTCTAGCAGAAATGCTCGATATATCTTTGTTAGTTGATCCTGTGTTTATATTATTCGCGTTATCAAACTTTTTAACAAGTATAGGGTTCTACATACCATATGTTTACACGGTACCAATGAGTGCCGCCTTGGGAATTGAAAATTCAGCATACCTGATATCTATAATTGGAGCATCAAACTTAGTAGGCAGAATCATTCTCGGATATATAAGCGACAAGCCGTGGGTCAACCGATTATTGGCTTACAATGTGTGCCTTACAATTGCCGGAATAAGTACTGCAATGGCGATGGCTTGCTGGGAGTTCTGGGGACTCGCTCTTTACGCGACCATGTTTGGCTTCACTATTGGTGCTTACGTGGGTCTCACTTCCGTAGTTCTTGTCGATCTTCTTGGTATAGAGAAATTGACAAACGCTTTCGGACTTCTGCTACTCTTCCAAGGAATCGCATCACTTATCGGACCACCATTTGCTGGTTGGTTGTTCGACTCTCTGAACTCCTACGCACCAGCCTTCTTCGTCGCTGGAGGGACGATATCACTAAGCGGTCTGATTCTATTCTTCATTCCGATGCTGGAGAGGCGGAGCAAAAGAGACACGACGATAGAACAAATTTAG

Protein sequence:

>DPOGS211152-PA
MVVFASFMIHIVTDGMTYSFGVFYAEFLTYFNEGQSKTAWIVSILVGVTLSSGPVSSSFVNRWGCRTVTVAGALLSSVCVVASAFAKNVTTLIFTIGVGAGVGFGLIYLPAIVSVTVWFKRYRSLATGIAVCGSGLGTFLFAPITDALISNYGWRGAMALIGAFILNCVPLGLMFRAVPERPITPITEPMLPKQNKSPLKRSQSSEQVVRANGKFEDDVARLTLSQPALNKPHEQQKPHSRQGSGIMQRPDVLYQGSMTSLARFRGVSPERVQLSFKREEREQSCGWLPCSEESKAALAEMLDISLLVDPVFILFALSNFLTSIGFYIPYVYTVPMSAALGIENSAYLISIIGASNLVGRIILGYISDKPWVNRLLAYNVCLTIAGISTAMAMACWEFWGLALYATMFGFTIGAYVGLTSVVLVDLLGIEKLTNAFGLLLLFQGIASLIGPPFAGWLFDSLNSYAPAFFVAGGTISLSGLILFFIPMLERRSKRDTTIEQI-