Monarch geneset OGS2.0

DPOGS204552
TranscriptDPOGS204552-TA1431 bp
ProteinDPOGS204552-PA476 aa
Genomic positionDPSCF300297 + 246105-257346
RNAseq coverage708x (Rank: top 18%)
Annotation
HeliconiusHMEL0165780.078.36% 
BombyxBGIBMGA004436-TA9e-11747.95% 
DrosophilaCG1213-PC2e-5633.63% 
EBI UniRef50UniRef50_B0WPL13e-5730.84%Sugar transporter n=5 Tax=Culicidae RepID=B0WPL1_CULQU
NCBI RefSeqXP_319647.43e-6332.65%AGAP008900-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582995466e-6232.65%AGAP008900-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582995462e-6332.65%AGAP008900-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00550852.7e-60transmembrane transport
GO:00160212.7e-60integral to membrane
GO:00228572.7e-60transmembrane transporter activity
KEGG pathway 
InterPro domain[62-458] IPR0058282.7e-60General substrate transporter
[12-457] IPR0161962e-52Major facilitator superfamily domain, general substrate transporter
Orthology groupMCL34554 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204552-TA
ATGTTAAAATATACAGCCAAAAAATATTATTCGGAAGGGAGCAAATTAAATCAAGTCGTGGTGGCAGTTTTGATGGTGCTGCCAGTGTTTTCCTATGGCATGGCCGTCGGTTGGTTGTCACCGATGGGTCCCTACCTCATGTCCGAGGACACCCCGGCAGCGAAGCCTGTTCATCCTGACGTCATATCCTGGATGGCCTCCGTCGCGTACCTGGTTGGAACTCCAGCTGTATTCCTGTTCGGTTATATTGTTGACAACTTTGGACGAAAAAAAGCGCTAATGTTGACTTCGTTTTCAATGGCGGTCTGTTGGGGTTTGAAACTATACTCGACTGAGACCTGGGCGTTGATAACAGCGAGAGCGATCGTTGGTTTTGGAGTTGGAGGCTCGTACGTCGTCACACCGTTGTACATTAAGGAAATAAGCGAAGACTCCATCCGTGGCACCCTGGGCAGTCTCGTGATATTATCGCAAAATTTTGGAAACTTGGTTGTATATATATTGGGAGAATATGTATGTTATCACGCCACCTTATGGATCTGCCTCGCTGTACCGCTAATACATCTGCTTGTGTTCCCCGCTATGCCAGAGACGCCTTCTTATTTGTTGAAGAGCGGAAAGGTCGAGGAGGCGAGGAGTGCCTTAGCTTGGCTGCGCTGCCGGCAGACTGGCGACGCTAACGTGGACACTGAACTACAATCGCTTCTCCTGGAGTTAGAACAGAGCAACTCCGGCAAATTCTTCACCACTTTAAAAACATTAGTATCGGATCCGAGCACTTTCCACGCGTTCCGTATAACTCTCACAATAACCCTGGCCCGTGAGCTCTGCGGGTGCCTGGCCGTGCTGCACTTCGCGTCTCTCATCTTCAGTAAAGCTAGTGGGGATTGGGTGTTGACAGCTAATCAGCAAGCTACCATCCTTGGTGTGGTGCAGCTTATAGGGTCATGTACAGCTTCCAGCTTAGTAGAGAGAACCGGCAGGAAGCCACTGTTAGGAGCGACCTGTCTAGTATCAGGCCTGGCACTAGTGTCTTTGGGCGGCTGGTTCTTGTGGGCGGGCGGCGTCGCAGCCTGGTTGCCAGCCTTCGCCCTTTGCCTCTGTATCTACTGCGATGCGGCTGGATTACAACCCGTACCTTTCGTCGTCATGACCGAAATGTTCTCATTCCAGTATCGCGGCACGGTAACGTCGATAGTGATAGCGTTCGCGTGTGCACTAGTTTCTATCGAGCTCCGTTTGTTCCATCCCCTGGCCACTCACCTCGGTCTGTACGTCATATTCTGGATCTTCGCAGCCGTCTGTCTCATCAGTACCGTGTACATCGTGTTCTGCGTCCCGGAGACTAAAAAGCGATCCATTGATGAGATATACGCTGAACTCGGCGGGAAGAAGAACAAAGACTGTGAAGCAGCAGTCACAAGGCTCTAG

Protein sequence:

>DPOGS204552-PA
MLKYTAKKYYSEGSKLNQVVVAVLMVLPVFSYGMAVGWLSPMGPYLMSEDTPAAKPVHPDVISWMASVAYLVGTPAVFLFGYIVDNFGRKKALMLTSFSMAVCWGLKLYSTETWALITARAIVGFGVGGSYVVTPLYIKEISEDSIRGTLGSLVILSQNFGNLVVYILGEYVCYHATLWICLAVPLIHLLVFPAMPETPSYLLKSGKVEEARSALAWLRCRQTGDANVDTELQSLLLELEQSNSGKFFTTLKTLVSDPSTFHAFRITLTITLARELCGCLAVLHFASLIFSKASGDWVLTANQQATILGVVQLIGSCTASSLVERTGRKPLLGATCLVSGLALVSLGGWFLWAGGVAAWLPAFALCLCIYCDAAGLQPVPFVVMTEMFSFQYRGTVTSIVIAFACALVSIELRLFHPLATHLGLYVIFWIFAAVCLISTVYIVFCVPETKKRSIDEIYAELGGKKNKDCEAAVTRL-