Monarch geneset OGS2.0

DPOGS203228
TranscriptDPOGS203228-TA1500 bp
ProteinDPOGS203228-PA499 aa
Genomic positionDPSCF300035 + 1280720-1287528
RNAseq coverage418x (Rank: top 29%)
Annotation
HeliconiusHMEL0032452e-15383.75% 
BombyxBGIBMGA009191-TA0.078.07% 
DrosophilaCG8596-PA2e-14354.18% 
EBI UniRef50UniRef50_B0XDF44e-14253.16%Putative uncharacterized protein n=4 Tax=Endopterygota RepID=B0XDF4_CULQU
NCBI RefSeqXP_624526.22e-16057.49%PREDICTED: similar to CG8596-PA [Apis mellifera]
NCBI nr blastpgi|1107601143e-15957.49%PREDICTED: major facilitator superfamily domain-containing protein 8-like [Apis mellifera]
NCBI nr blastxgi|1107601148e-15559.04%PREDICTED: major facilitator superfamily domain-containing protein 8-like [Apis mellifera]
Group
Gene OntologyGO:00550858.1e-34transmembrane transport
GO:00160218.1e-34integral to membrane
KEGG pathwayame:5521445e-160 
 K12307 (MSFD8, CLN7)maps-> Lysosome
InterPro domain[19-483] IPR0161961.2e-42Major facilitator superfamily domain, general substrate transporter
[31-348] IPR0117018.1e-34Major facilitator superfamily
Orthology groupMCL13867 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203228-TA
ATGGGTTTGGTGGGTCGAAATCGTACAGAGGGCGATGCGCTGGAGTCAGCTCAGGAACGTCGCGAGCGCTGGAGGAGTGTTTATATCATATATTTCACTATGTTCCAAATGTCACTTGGTTTCAGTATCGTGCTTACTGGCGTGTGGCCTTACCTTGATAAGTTGCAGCCAGGGTCTAGCAAAGAAGCTTTAGGGTTGGCAGTTGGAGCCAGTCCGCTTGGACAACTCGCAGCCTCGCCTTTACTCGGTTTTTGGGCAAACCGTGCAGGAAGTGCTCGTGGCCCAATGTTAACAACCCTGGCATTGTTCGTCATGGCATCAACTTTGTACGCACATTTGCACTTGACGCGCCCATATGCTCATCATTGGATGCTGGCAGCAAGAGCTCTTGTTGGAGTTAGTTCGGCGAATGTGGCTGTAGCAAGGTCATATTTGTCGGCTGCAACCCGTGAAAGCGAAAGAACGAGAGCTGTCGCTGGAGCATCATTGGCCCAAGTCCTAGGTTTTGTAGTTGGCCCGGCTCTACAGGCTGCTGTTGCCCCTCTCGGGCCCGGCGAACCTTACCCTCCACTCGGACAATACAATCATCCAATAAGGTTGGATATGTACACAGCTGCCGGTTGGATTAACGCAGTACTAGGACTCATAAATTTTATTCTCTTTCTTCCTTTCTTTTTTAAAGAAAAGAAGATAGCTGCCAGAGAAGCTATGTTGGCTCATGGCAAGGAAACGGAAAAAGAAGCAATGAAGGCTATTAAACCGGACTATGTCAGTAGTTGGATGCTTGTAGGTGCTTTCTTCGTTTTGGTGTTTAACTTCGTTCTTCTTGAGACTTTGGCAACTTCCCTAACCATGGATCAGTTTGCATGGAATAAGAAGCAAGCATTAGAATACATGGGAGCACTCATGAGTGCTGGCGCAATAGTGGCTTGTGTTGTGTTCGCGCTCATTACGCCTCTTACTAAGTTATTTGAGGAAAGAGCATTGTTACTCTGGGGAGGTTTTCTGTTAACCGGCATGGCATCAATACTCTGTATTCCCTGGGGTCCGGGACCTCCACCATTGGCGGGTAGCTCCGGTGTAACGGAGGAGGCAGGTGGCGGCTGCCCTCAGCACAGCCAGCCCTGGTGCGAGAATTCACGCGGTTTAACTATCGTGCAATTCTTATTGGGATATACCTGTGTGTCTATTGGTTATCCGCTAGGAGTTACACTTATACAAACTATATTTTCTAAGGTTTTAGGTCCTCGACCTCAAGGCGTCTGGATGGGAGTCCTTACAGGAGCCGGATGCGTCTCCCGTGCACTTGGTCCAGTGTTCGTGTCAGCAGTTTACGCCAGACATGGACCCGACGCTACCTTCGCATCGACTGCGGCTTTAACCTTCGTAGCGTTATTAGCTTTGAGGCTTGTATACTCCAGGCTGAAGCCACCTCCCAGCCCAGAGGTTGTACCGCCAAGAGAGATGATGCCGTTAAAACAGAATGAAAACGTATCGTGA

Protein sequence:

>DPOGS203228-PA
MGLVGRNRTEGDALESAQERRERWRSVYIIYFTMFQMSLGFSIVLTGVWPYLDKLQPGSSKEALGLAVGASPLGQLAASPLLGFWANRAGSARGPMLTTLALFVMASTLYAHLHLTRPYAHHWMLAARALVGVSSANVAVARSYLSAATRESERTRAVAGASLAQVLGFVVGPALQAAVAPLGPGEPYPPLGQYNHPIRLDMYTAAGWINAVLGLINFILFLPFFFKEKKIAAREAMLAHGKETEKEAMKAIKPDYVSSWMLVGAFFVLVFNFVLLETLATSLTMDQFAWNKKQALEYMGALMSAGAIVACVVFALITPLTKLFEERALLLWGGFLLTGMASILCIPWGPGPPPLAGSSGVTEEAGGGCPQHSQPWCENSRGLTIVQFLLGYTCVSIGYPLGVTLIQTIFSKVLGPRPQGVWMGVLTGAGCVSRALGPVFVSAVYARHGPDATFASTAALTFVALLALRLVYSRLKPPPSPEVVPPREMMPLKQNENVS-