Monarch geneset OGS2.0

DPOGS213304
TranscriptDPOGS213304-TA1407 bp
ProteinDPOGS213304-PA468 aa
Genomic positionDPSCF300130 - 520120-523770
RNAseq coverage156x (Rank: top 52%)
Annotation
HeliconiusHMEL0118818e-10960.76% 
BombyxBGIBMGA005604-TA1e-7337.59% 
DrosophilaCG1213-PC9e-5330.09% 
EBI UniRef50UniRef50_Q7QJE98e-6936.36%AGAP007483-PA n=3 Tax=Culicidae RepID=Q7QJE9_ANOGA
NCBI RefSeqXP_308390.41e-6936.36%AGAP007483-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3800242262e-6834.71%PREDICTED: facilitated trehalose transporter Tret1-like [Apis florea]
NCBI nr blastxgi|3800242263e-7234.93%PREDICTED: facilitated trehalose transporter Tret1-like [Apis florea]
Group
Gene OntologyGO:00550855.1e-68transmembrane transport
GO:00160215.1e-68integral to membrane
GO:00228575.1e-68transmembrane transporter activity
GO:00160205.1e-05membrane
GO:00228915.1e-05substrate-specific transmembrane transporter activity
KEGG pathway 
InterPro domain[21-450] IPR0058285.1e-68General substrate transporter
[1-455] IPR0161961.8e-51Major facilitator superfamily domain, general substrate transporter
Orthology groupMCL34973 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213304-TA
ATGACAAAAACAAATAGAAAAGTGCAATATTTGGCGGGTTTGTGTGTATCATTGGCATTCACGTTTACTGGGGCTGTAAATACTTGGGCTTCACCTGCGATTCCAAAGTTCAAAAATGGCGATGCGAACATTGTTATTTCAGACGCACAAACATCGTGGGCAGTATCGGTATCGGCCTTGGGGTCATTGCCAGGATGTTACTTTGGCCGGGAGCTGAGCGAACGTGTAGGACGTCGAAAAACTATAATCCTGGCCGCAGTTCCAGGATTTGTAGGTGCGATGATCATTCTATTTACGAAATCCCCATTGCTGATGTGTTTTGCAAGGATTCTGATGGGGATTGCAAATGGCATCACTGCAGTTGTTACAATGATTTATTTGACAGAAATAGCGGATAAGGAAATTAGAGGAGCCTTGGGAATGTTGGTACAGGTTATGAACAATTTGGGAAGTTTGGTCCTTTATGGTATAGGGCCGTTCGCGTCGTACAACGTGTTGAACTTGATAGTACTTTTTATATCTGCGTTCTTCGCTCTGTTATGTCTGTGGGTACCCGAATCTCCTTATTACCATTTGGCGAGAGGAAACGTAGCTGCCGCAAAGAAATCATTTTTGTTTTTGAAAGGTTCTAAGGACAGTAAGTGGGCTGATGAACAAATGGGTATAATGAGGGTGCACGTTCAAGAGAGTATGGAGAATAGAAGCACTCTCAGAGAATTGATCAGTAACATGAAATATAGGAGAGCTATCTACATCATCGCTGGCTTAAAAGTCTTGCAGTATATGACCGGTAGTTTGGCTATACAAGCTTATTTGGAGGTAATATTCCGTCAGAGCAGTTCAATATCGGGGCCGTACGCTAGTATTGTTTATGGATTCGTCCAACTCGGTGCAGGTATCGGAGCTACGTTTCTGGCTGGATATTTCGGTAGACGGATTCTCATGCTGTTTTCGAGCCTCGGTGTTGCCATGTCGCTAACAATAGTCGGTGTATATTTTTTCTTAAAAGACTCTGTAGTCGTGAACAAGGAGGTTTTATCATCAATATCTTCGTTGCCATTAATCGGGGTTCTGGGTTTTAATGTTTTGTATGCAGCCGGTTTAGGAAATTTGCCTTATATAATGCAAGCCGAGCTGTTCCCTATGAACGTTAAAGCGATCGCTTCTAGTATGGCGACTATGCTCGCTTGTGTGCTGGCGTTTTCCGTGACTAAGTCCTATCAAGGTATCAAGGATGTTTTCGGTCACTACACGGTGTTTTGGTCCTTTGCCGCCGTCGCTGGCTTCGGAGTGTTCTTCATATATTTCTTCGTCCCCGAAACTAAGGGGAAAACGTTAGAGGAAGTCCAAGACAACATGCAAGAGGCAGTCGTAGAAATAGAAAGACTAAACAAGACGGATAATTGA

Protein sequence:

>DPOGS213304-PA
MTKTNRKVQYLAGLCVSLAFTFTGAVNTWASPAIPKFKNGDANIVISDAQTSWAVSVSALGSLPGCYFGRELSERVGRRKTIILAAVPGFVGAMIILFTKSPLLMCFARILMGIANGITAVVTMIYLTEIADKEIRGALGMLVQVMNNLGSLVLYGIGPFASYNVLNLIVLFISAFFALLCLWVPESPYYHLARGNVAAAKKSFLFLKGSKDSKWADEQMGIMRVHVQESMENRSTLRELISNMKYRRAIYIIAGLKVLQYMTGSLAIQAYLEVIFRQSSSISGPYASIVYGFVQLGAGIGATFLAGYFGRRILMLFSSLGVAMSLTIVGVYFFLKDSVVVNKEVLSSISSLPLIGVLGFNVLYAAGLGNLPYIMQAELFPMNVKAIASSMATMLACVLAFSVTKSYQGIKDVFGHYTVFWSFAAVAGFGVFFIYFFVPETKGKTLEEVQDNMQEAVVEIERLNKTDN-