Monarch geneset OGS2.0

DPOGS214409
TranscriptDPOGS214409-TA1761 bp
ProteinDPOGS214409-PA586 aa
Genomic positionDPSCF300069 + 109628-115921
RNAseq coverage136x (Rank: top 56%)
Annotation
HeliconiusHMEL0128780.076.30% 
BombyxBGIBMGA011226-TA0.064.46% 
DrosophilaCG42269-PE0.059.05% 
EBI UniRef50UniRef50_E2AZ430.060.92%Ectonucleotide pyrophosphatase/phosphodiesterase family member 4 n=15 Tax=Pancrustacea RepID=E2AZ43_CAMFO
NCBI RefSeqXP_967323.20.062.48%PREDICTED: similar to AGAP006609-PA [Tribolium castaneum]
NCBI nr blastpgi|1892353650.062.48%PREDICTED: similar to AGAP006609-PA [Tribolium castaneum]
NCBI nr blastxgi|1892353650.062.93%PREDICTED: similar to AGAP006609-PA [Tribolium castaneum]
Group
Gene OntologyGO:00550852.9e-28transmembrane transport
GO:00160212.9e-28integral to membrane
GO:00228572.9e-28transmembrane transporter activity
KEGG pathway 
InterPro domain[24-554] IPR0161963.2e-53Major facilitator superfamily domain, general substrate transporter
[177-551] IPR0058282.9e-28General substrate transporter
Orthology groupMCL10425 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214409-TA
ATGGAATCACATGAGAAGGTGGTCGAGGTCCTGGGCGGAAGCCGCGACCTTGACCATGGACAAATAGAAATGCAAAACACCAGGGTAAACGGTTCAAGACCTTCAAAGCCACAGCTCAAAGATCTTGACGACTTGTTCCCGTATATCGGAGAATTTGGCTGGTACCAACGGATTCTGTTCCTCCTCATGATACCCTACTCTTTCTTCTTCGCGTTCGTCTACTTCTCTCAGATATTCATGACCATTGTGCCAGAGGAACACTGGTGCTTTGTCCCTGAACTGGCCAACCTGACTGTGGAAGAGAGGCGCTATCTCTCGATCCCAATGAAAGATGATAGTAAATATGAAAAATGCCACGTATACGACGTGGATTGGTCGAAGGTGTTAGACACGGGGGTTCTAACACCGGATTCTAATTGGCCAGTGAAGAAATGTGAGCAACAATGGGAATATAACTATACTGATGTCCCCTATGAGACGATCGCAAGTCAGCTGGATTGGGTGTGTGACAGAGACAACTATGCCGCAACTGCCCAATCAGTTTTCTTTTGTGGGTCCATCATCGGAGGGCTAATATTTGGATGGGTAGCTGATAAATTTGGGCGAGTTCCAGCTATACTGGGTACGAACATGGCTGGTTTCTTGGCGGGGGTCGGAACTGCGTTTGCGAACAGCTTCTGGTCTTTCTGCCTCTGCAGATTCCTCGTGGGCCTGGCCTATGACAATTGTTTTATGGTCATGTACATAGTCGTGGTCGAATACGTTGGTCCAAAATGGCGTACCTTCGTTGCCAACATGTCTATAGCAGTTTACTTTACATTCGCTGCATGTCTTCTACCGTGGATAGCGCTGGCCGTGGCTGACTGGAAGATGTACACGCTGATAACCAGCGTGCCATTGGTGCTGGCTATTTTCACGCCATGCGTAGTTCCTGAAAGCGCTAGATGGCTTATATCCCAGGGACGTATAGACGAGGCTCTTAAAATATTGAAGAAATTCGAAAGAATAAACAAGAAGAAAATACCTGATGAAATATTTAAAGAATTTAAGGACACCGCCACAAAAATAGCAAAAGCAGAAGAAGAGATCAGGAATTATTCTTTTCTGGATCTCTTCAAGACCCCTCGCCTGCGGCGTCACAGTCTTCTTTTGTTGGTCATCTGGATGTCGATAGCCATGGTCTTCGATGGTCACGTCAGGAATGTCGGTTCTCTGGGCTTGGATATCTTCCTGACATTCACAGTCGCTACCGCCACGGAGTTTCCCGCTGATGTTCTACTGACTTTGACCCTGGATATTGTCGGTCGTAGATGGCTCGCTTTCGGATCCATGTTGTTAAGTGGTATATTCAGTTTCCTCGCGACAACGGTGCCTCATGGTGTACCGTCAGCAACATTGGCCATAATCGGTCGCTTTGCTGTGAACATATCCTTCAACATCGGCATGCAGTACGCGGCGGAGTTATTACCAACTGTGGTGAGGGCCCAGGGTCTGGCACTGATACACATCACGGGTTACGTGGCTACAATACTTGTGCCATACATCGTGTATTTGGCAACAATATCTCCTATCATTCCTCTTCTAATCCTTGGAACCATCGGAATCTTCGGTGGCTGTCTCTGCCTCTGTCTCCCAGAGTCCCTGGGCAAGGATATGCCGCAGACTTTGCAGGACGGAGAAAATTATGGAAAGGAGCAGAAATTCTGGGACTTCCCCTGTTTTAAGAGGAAAAACCAGGTTCCAGTGGAAAACAGATACTGA

Protein sequence:

>DPOGS214409-PA
MESHEKVVEVLGGSRDLDHGQIEMQNTRVNGSRPSKPQLKDLDDLFPYIGEFGWYQRILFLLMIPYSFFFAFVYFSQIFMTIVPEEHWCFVPELANLTVEERRYLSIPMKDDSKYEKCHVYDVDWSKVLDTGVLTPDSNWPVKKCEQQWEYNYTDVPYETIASQLDWVCDRDNYAATAQSVFFCGSIIGGLIFGWVADKFGRVPAILGTNMAGFLAGVGTAFANSFWSFCLCRFLVGLAYDNCFMVMYIVVVEYVGPKWRTFVANMSIAVYFTFAACLLPWIALAVADWKMYTLITSVPLVLAIFTPCVVPESARWLISQGRIDEALKILKKFERINKKKIPDEIFKEFKDTATKIAKAEEEIRNYSFLDLFKTPRLRRHSLLLLVIWMSIAMVFDGHVRNVGSLGLDIFLTFTVATATEFPADVLLTLTLDIVGRRWLAFGSMLLSGIFSFLATTVPHGVPSATLAIIGRFAVNISFNIGMQYAAELLPTVVRAQGLALIHITGYVATILVPYIVYLATISPIIPLLILGTIGIFGGCLCLCLPESLGKDMPQTLQDGENYGKEQKFWDFPCFKRKNQVPVENRY-