Monarch geneset OGS2.0

DPOGS204659
TranscriptDPOGS204659-TA1380 bp
ProteinDPOGS204659-PA459 aa
Genomic positionDPSCF300170 - 475825-483014
RNAseq coverage370x (Rank: top 32%)
Annotation
HeliconiusHMEL0082460.078.73% 
BombyxBGIBMGA007462-TA0.075.46% 
DrosophilaCG4726-PA5e-15159.16% 
EBI UniRef50UniRef50_Q9VPX27e-14959.16%CG4726 n=25 Tax=Pancrustacea RepID=Q9VPX2_DROME
NCBI RefSeqXP_318686.32e-17466.29%AGAP009649-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582985123e-17366.29%AGAP009649-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582985128e-17266.29%AGAP009649-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00550851.5e-52transmembrane transport
GO:00160211.5e-52integral to membrane
KEGG pathway 
InterPro domain[1-439] IPR0161964.3e-63Major facilitator superfamily domain, general substrate transporter
[62-431] IPR0117011.5e-52Major facilitator superfamily
Orthology groupMCL16318 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204659-TA
ATGGACGTACCAGCAAAAGGAAATATATTGGGGCGTTTTGTACCAGCCCGGTATATCCTCGCGATCCTGGGCTCTTTAGGGATGGCCATAGTTTACGGGCTCAAGGTCAATCTGTCGGTGGCTATGGTGGGCATGTTGAATCATACCGCGATCAAATCTATGGAACACCATAACACGGAATTCAATTCTACCGTCTCTGATGTCGAATGCCTGCCGGCTAAGAATGACACACATGGAGAGGAAGCCGACGGTCCATTTACTTGGTCGTCTGAAGTTCAGGGTATTGTTCTCAGTTGCTACTTCTGGGGCTACTTCATTTCTCAAATTCCTGGTGGTCGCATAGCGGAGTTATTTTCCGCTAAATGGGTGATGTTCTTCAGTGTTGCAATCAACGTCGTGTGCACGCTGCTGACTCCTGTCATGGCAGAGTTGCACTACCTGGCAGCCGTGGTGATGAGAGTGGGCGAGGGTATCGGAGGGGGCGTGACGTTCCCTGCGATGCACGTGTTGCTGTCTCGCTGGGCGCCGCCCGCTGAGCGGTCGTTGCTGTCGGCTCTGGTCTACGCTGGCACGAGCCTGGGCACCGTCGCGTCTATGCTGCTCGCCGGTTTACTCACCGCAACCGCTGGTTGGGAGAGCGTATTCTACGTGATGGGCGGTCTGTCCGTACTGTGGTGCGGTTTGTGGGTGACGTTAGTGGCGGACGATCCCAGAACACAGAGACTCATCAGTTTAGAGGAGAGAGAGATGATTGTTAACTCTCTGGGGAGGAAAACTGCCAGCGCTGAACGAAAGAAGCTGCCTGTACCGTGGAGGTCAGTCGTGACATCAGGTCCGTTCCTCTCCATCCTGGTGTCCCACACGTGTTCCAACTGGGGCTGGTACATGCTGCTCATTGAACTGCCGTTTTATATGAAGCAGATATTGATCAAGTATTATCTGTCACAGAACGCTGTAACCACAGCTCTGCCGTTTCTCTCGCTGTGGTTCTTCAGTATGGCGCTGAGCAGGACATTGGACTGGTTGCGGGCTAAAGGCAGTATTACAACAACCACTGCTAGGAAGATAGGGACTTTGTTTGCATCAGCGGTGCCAGCTGTATGTTTGTTCTGTCTCTGTTTCGTTGGTTGTAACCGGTCCTTGGCGGTGGCACTCACAGCGGTCGGCGTCACCTCAATCGGTGGAATGTTCTGTGGATTTCTCTCCAACCATATCGACATCGCCCCTAACTTCGCCGGTACGCTAATGGCAATAACGAACACGGTCGCAACGATCCCCGGTATAGTCGTGCCAGTTTTCGTCGGTGTTTTGACACATGGGAACGTAAGTGACACCATGATGAACAAAAAACCCAAGTTTAATCTTATAAGTTATATTTGA

Protein sequence:

>DPOGS204659-PA
MDVPAKGNILGRFVPARYILAILGSLGMAIVYGLKVNLSVAMVGMLNHTAIKSMEHHNTEFNSTVSDVECLPAKNDTHGEEADGPFTWSSEVQGIVLSCYFWGYFISQIPGGRIAELFSAKWVMFFSVAINVVCTLLTPVMAELHYLAAVVMRVGEGIGGGVTFPAMHVLLSRWAPPAERSLLSALVYAGTSLGTVASMLLAGLLTATAGWESVFYVMGGLSVLWCGLWVTLVADDPRTQRLISLEEREMIVNSLGRKTASAERKKLPVPWRSVVTSGPFLSILVSHTCSNWGWYMLLIELPFYMKQILIKYYLSQNAVTTALPFLSLWFFSMALSRTLDWLRAKGSITTTTARKIGTLFASAVPAVCLFCLCFVGCNRSLAVALTAVGVTSIGGMFCGFLSNHIDIAPNFAGTLMAITNTVATIPGIVVPVFVGVLTHGNVSDTMMNKKPKFNLISYI-