Monarch geneset OGS2.0

DPOGS201102
TranscriptDPOGS201102-TA1428 bp
ProteinDPOGS201102-PA475 aa
Genomic positionDPSCF300137 - 432032-435129
RNAseq coverage647x (Rank: top 20%)
Annotation
HeliconiusHMEL0053189e-16272.55% 
BombyxBGIBMGA013656-TA1e-13755.16% 
DrosophilaCG6484-PA1e-4626.36% 
EBI UniRef50UniRef50_B0WPL17e-5430.19%Sugar transporter n=5 Tax=Culicidae RepID=B0WPL1_CULQU
NCBI RefSeqXP_319647.44e-5732.21%AGAP008900-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582995468e-5632.21%AGAP008900-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582995467e-5732.06%AGAP008900-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00550859.4e-62transmembrane transport
GO:00160219.4e-62integral to membrane
GO:00228579.4e-62transmembrane transporter activity
KEGG pathway 
InterPro domain[18-458] IPR0058289.4e-62General substrate transporter
[1-456] IPR0161961.3e-52Major facilitator superfamily domain, general substrate transporter
Orthology groupMCL19868 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201102-TA
ATGAAGAAATTGCGTTTCTTTTGTGAGGGCAGCCAGTTTAATCAAATACTATGTGCATTTTTGATATCTATACCAATGTTCTGTTATGGAAACACTATTGGCTGGATGTCTCCTATGACGCTTCTGCTACAATCAGATAAATCACCCAAAGGGGTGCCTCTGACAGATCTCGAGATTTCTTGGATGGCATCTTTGCCATATTTAGTGTGTGTGCCTGGTACGTATTTAATGGCTGCGATTGGAGATCGCTACGGAAGAAAATTAGCTCTTCTCATAATGTCCGGTATATCGGTGGCTGTATGGGTTTTGAAGTTAAGCTCTATGAATATCTGGGTTTTCATCATAGCGAGGATCTTAGCTGGCATTATCATGGCGGGAAGCTGCGTGACCTGTCCCACCTACATTAAAGAGATCAGCGAAGACAACATCAGAGGAGCTCTAGGTTGCTGGGGCGCGCTGTTTTTCACAACTGGAAGTTTATTTGCCTACATAATATGCGATGTCTGCAGCTACAATGTGATATTAATAATTTTCACTATCATACCGGCTGTGCATTTTGTGATATTCCTGACGATGCCAGAATCACCTTCGTATCTTATAAAGAGGGGGAGAGAAGAAGAAGCATCAAAATGTTTGCAATGGTTGAGATGCAGAAGCGAATTTGATTCAACTATTAAAAGTGAAATAGATTACGTGAAACGAGAGCAAAAGAATGATGAGGGTCGAGAACAATTTCTATTAAGAAACATTCTTTCGGACAAGATCCTCCGAAGAGCATTTCAAATATCTTTAGTAGCAGCGTTGTCCAGAGAGCTTTGTGGAGCTGTTCCAGTTTTGAATTTTGCTGGGGACATCTTCCATTTAGCCTCGGAGGAAACTGGTCTGAAACTCAGTGCCAACCAACAGGCCATGGTGCTCGGCACGGTCCAGTTGTGTGGGGCTACCTTAGCTTCTGGTATTGTCGAAAGATGTGGCCGTCGACCCCTACTCTTCGTTAGTTCAGCGATATCTGGTCTAAGCATGTGTCTATTGGCAACCTGGTTCCTGTTACAATACCTCCACCCTCCAGCATGGATTCCTGTGATAACGCTGTGCTTATGCATATTCTGTGATGCTGCTGGGCTGATGCCTATAGCTGTTGTGATAGCCAGCGAGACCTTCTCTTTTAAGTATCGAGGCACAGTATTAGCAACGACAATGGCAATAGCATCAGTGGCAGACTTTATACAGCTTTTGTTCTTCAAGCCTTTAGTGAGAGCAATTGGGATACACGTGTCATTCTACTTTTTTGGCCTGATGTGTCTACTCACGGCTGTCTACGTTATAATAATGGTACCAGAAACTAGGAACAGGAAATTAGAAGAAATTTATTACGACTTGAAGACTAACAAAGAGAAAAAAGAATTGGAGAATAGAAATGATTTTTAA

Protein sequence:

>DPOGS201102-PA
MKKLRFFCEGSQFNQILCAFLISIPMFCYGNTIGWMSPMTLLLQSDKSPKGVPLTDLEISWMASLPYLVCVPGTYLMAAIGDRYGRKLALLIMSGISVAVWVLKLSSMNIWVFIIARILAGIIMAGSCVTCPTYIKEISEDNIRGALGCWGALFFTTGSLFAYIICDVCSYNVILIIFTIIPAVHFVIFLTMPESPSYLIKRGREEEASKCLQWLRCRSEFDSTIKSEIDYVKREQKNDEGREQFLLRNILSDKILRRAFQISLVAALSRELCGAVPVLNFAGDIFHLASEETGLKLSANQQAMVLGTVQLCGATLASGIVERCGRRPLLFVSSAISGLSMCLLATWFLLQYLHPPAWIPVITLCLCIFCDAAGLMPIAVVIASETFSFKYRGTVLATTMAIASVADFIQLLFFKPLVRAIGIHVSFYFFGLMCLLTAVYVIIMVPETRNRKLEEIYYDLKTNKEKKELENRNDF-