Monarch geneset OGS2.0

DPOGS216072
TranscriptDPOGS216072-TA1512 bp
ProteinDPOGS216072-PA503 aa
Genomic positionDPSCF300067 + 379341-395961
RNAseq coverage771x (Rank: top 17%)
Annotation
HeliconiusHMEL0089340.076.33% 
BombyxBGIBMGA008871-TA0.084.00% 
DrosophilaCG9864-PA2e-11745.80% 
EBI UniRef50UniRef50_D6WVB62e-12846.72%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WVB6_TRICA
NCBI RefSeqXP_311575.44e-13048.49%AGAP010370-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3123823115e-13246.75%hypothetical protein AND_05066 [Anopheles darlingi]
NCBI nr blastxgi|3123823113e-13347.99%hypothetical protein AND_05066 [Anopheles darlingi]
Group
Gene OntologyGO:00550856e-54transmembrane transport
GO:00160216e-54integral to membrane
KEGG pathway 
InterPro domain[1-485] IPR0161962.8e-76Major facilitator superfamily domain, general substrate transporter
[23-430] IPR0117016e-54Major facilitator superfamily
Orthology groupMCL14888 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216072-TA
ATGAAGTTCCACATAAAAATATTTGATGTTATACCGGCGAGGTTAAACGTGGCGTTAATGATGTTCTTCGCCTGCTGGGTCAATTACATGCTGCGCGTTAACATGAGTGTCAATATCATTGCCATGGTTCCTGATCGTGGTGAAACTAAGTCGGTACAAAGCGAATGTGAGGCCATCACTAATGACGACACTGCATTACATAATGGTACGACAGCGGTTACACGACAAGTCCAACAGCCCGATGGATCTATTACTTTTGATTGGACAGCGCAACAACAGGCATACGTGCTCTCCGGATACTTCTGGGGTTACGCAATCACTTGCCTCTTCAGTGGTATAGCAGCGGAGAGATGGGGTCCAAGGAATGTAGTCTTCATATCCATGTTGATATCTGCGGTCCTCACAATTCTCATTCCCCCAGCAGCGAAAGTACATTTCATGATGCTAGTAGCTACAAGATTCGTCATAGGTCTTGCTGCGGGCTTCCTTTTCCCGTCACTACACGCGCTTGTTGCTCATTGGGCTCCTCCAGCAGAGAAGGGGAAGTTTGTGAGCGCTCTCCTAGGAGGGGCCATAGGAACCGTCGTGACCTGGTCGCTTAGTGGACCTCTTATTGAGAACTTTGGGTGGACTTACGCATTTTATGTACCAGGTATTATAGCCATCGTTTGGTGTGCTGCGTGGTGGTTCCTTGTATACGATTCCCCCGTCATACATCCACGTATTAGCGAAGAAGAAAAAACGTACATCCTAAGTGCCATTGGAGACAAAGTGCAACAGAGTTCCAAGGAGCATAAAATTGTTCCACCGTTTAAAGATATATTCACGTCGTTTCCGTTCCTCGCCATGGTTATCCTCCACTATGGTAACACATGGGGTATATACTTCGTAATGACAGCCGCACCAAAATACGTGTCAAGTGCTCTCGGATACAATTTGACTTCAACGGGCACTCTGTCATCACTACCTTACCTTGCGAGGATGATATTTTCATTAATATTCGGAGCTATTGGTGACAGAATCGTCAAACAGAACGTTGTATCCACGACGTTTATGAGGAAGTTCTTCTGCTTGTTTTCCCACGTGGTGCCGGGTCTGCTGCTCATCGGTCTGGGCTACACGGGCTGTGCCCCCATCTTGTCAGTGGCTCTTATAACATTCTCCATGGGCTCCAATGGCGCCGCCACACTCACTAACTTAGTGAACCACCAGGATCTGGCGCCAAACTTTGCCGGCACCATTTACGGCATAGCCAATGGTATTGGTAACACAGCTGGTTTCATAACACCGCTTGTGACTGCCTACTTCACCAAACATGGGAATGGTTTTGCGGAATGGCGGCCAGTTTTCCTCACGGGAGCCTCAATATACATTGCCGCAGCAGTTTACTTCATTCTCTTCGGCACCGGTGAAACACAATCGTGGAATTACGTCGCCCCGGCGGAAGACGATAGGGACAAGAGGCCCAATAACAGCGAAGATACCACCGTCAACATACCAGTTAAAACATAA

Protein sequence:

>DPOGS216072-PA
MKFHIKIFDVIPARLNVALMMFFACWVNYMLRVNMSVNIIAMVPDRGETKSVQSECEAITNDDTALHNGTTAVTRQVQQPDGSITFDWTAQQQAYVLSGYFWGYAITCLFSGIAAERWGPRNVVFISMLISAVLTILIPPAAKVHFMMLVATRFVIGLAAGFLFPSLHALVAHWAPPAEKGKFVSALLGGAIGTVVTWSLSGPLIENFGWTYAFYVPGIIAIVWCAAWWFLVYDSPVIHPRISEEEKTYILSAIGDKVQQSSKEHKIVPPFKDIFTSFPFLAMVILHYGNTWGIYFVMTAAPKYVSSALGYNLTSTGTLSSLPYLARMIFSLIFGAIGDRIVKQNVVSTTFMRKFFCLFSHVVPGLLLIGLGYTGCAPILSVALITFSMGSNGAATLTNLVNHQDLAPNFAGTIYGIANGIGNTAGFITPLVTAYFTKHGNGFAEWRPVFLTGASIYIAAAVYFILFGTGETQSWNYVAPAEDDRDKRPNNSEDTTVNIPVKT-