Monarch geneset OGS2.0

DPOGS214408
TranscriptDPOGS214408-TA1950 bp
ProteinDPOGS214408-PA649 aa
Genomic positionDPSCF300069 + 92147-104998
RNAseq coverage211x (Rank: top 46%)
Annotation
HeliconiusHMEL0128783e-17554.24% 
BombyxBGIBMGA011225-TA0.066.99% 
DrosophilaCG42269-PE1e-15150.42% 
EBI UniRef50UniRef50_E2AZ433e-17248.79%Ectonucleotide pyrophosphatase/phosphodiesterase family member 4 n=15 Tax=Pancrustacea RepID=E2AZ43_CAMFO
NCBI RefSeqXP_967323.20.055.82%PREDICTED: similar to AGAP006609-PA [Tribolium castaneum]
NCBI nr blastpgi|1892353650.055.82%PREDICTED: similar to AGAP006609-PA [Tribolium castaneum]
NCBI nr blastxgi|1892353650.053.97%PREDICTED: similar to AGAP006609-PA [Tribolium castaneum]
Group
Gene OntologyGO:00550851.2e-22transmembrane transport
GO:00160211.2e-22integral to membrane
GO:00228571.2e-22transmembrane transporter activity
KEGG pathway 
InterPro domain[48-414] IPR0161962.7e-32Major facilitator superfamily domain, general substrate transporter
[197-409] IPR0058281.2e-22General substrate transporter
Orthology groupMCL10425 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214408-TA
ATGGCGCCACAAGAACGTACAACCTGCTGCGAGGATTCCCGCCAGGAAGACAATTTAGCGAACATAGATCTTAACAACGGGTTCATTAATAAAGCCTTCACAAACGACGTGAGCGATAATGTCAATACAAAAAATCATACGGAGGAGCCCAAACAAACAAAAAAGGTCAAAGATTTCGACGACCTGTTGCCATACATCGGAGAATTTGGCTGGTATCAGAAAATTCTGTTCCTATTAATGATACCATTCGCATTTTTTGTGTCTATAGTGTATTTTTCGCAAATATTCATGACCATTGTACCCGAACAACATTGGTGCTGGATACCTGAGCTGGCGAATTTGACGGCCGAGGAAAGACGAGCTTTAGCCATCCCAGCGAAATCAGATGGACCCTTCACCCACGACAGATGCAATATGTACTCAGCGGACTTTTCACTGGCTCTGGAACAAGGTAGAACCACTCCTGATGATGAGTGGAAGATCATACCTTGCTCTCACGGCTGGGAGTACAATAGGAGCGATGTGCCTTATGAAACAATCGCGTCACAGCTCGACTGGGTCTGTGATAAGGACAACTACGCAGCGACCGCCCAGGCTATATTCTTCTGTGGTGCGATTGTCGGCGGTTTAGTGTTCGGTTGGATCGCTGACAAGTATGGGAGAATCCCAGCCTTAGTAGGTACAAACCTCGTTGGGTTCGCTGCCGGTGTTGCCACTGCCTTCTGCAACACCTTCTGGACCTTCACACTTTGCCGATTCCTCGTGGGTTTGGCGTTTGACAACTGCTTCACCATGATGTATATCTTAGTTTTAGAATACGTAGGACCAAAATGGCGTACATTTGTTGCCAACATGTCAATAGCTATTTTCTTCACCCTGGGCGCGAGTCTCCTGCCCTGGGTTGCTCTGTGGGCGGGTGACTGGCGTCGGTACGCTCTTTTCATTAGCGCGCCCTTCATAATAGCCGCTGCCACACCCTGGGTTGTACCGGAAAGCGCCAGATGGTTACTGTCGCAGGGAAAAATTGACAAGGCTCTGGTTATAATGAAAAAGTTCGAAAGAGTTAACAAAACTAAGATTCCTGACAAAGTTTTAAATGAATTCACTGAAGCAACCCAACAAGCAATCAAAGATTCTGAAGCCCATAAGTCGTATTCAGTTCTGGACATCTTCAAGACACCGAGGTTACGGAAGAACGCTTTATTATTGATAGTCATCTGGATGGCAATATCCCTGGTCTTCGACGGGCACGTACGGAACGTCGGTTCCCTGGGACTCGACATTTTCCTGACCTTCACCATCGCTACTGCAACCGAGTTCCCGGCGGACACCTTTCTAACCGTTGTACTCGATAGGTGGTCCGTCAGCATCCCTGGCGATCCTGGGACGGTTCGCGGTCAACATCTCCTACAACATAGGGCTGCAGCCAACATATCTCCTGACCTACCCCTCTTAATCCTGGGTGTTTTGGGCATCGTGGGCGGAGGGCTCTGCCTGCTGCTACCAGAGACCATGGACACGGAGATGCCACAGACCCTCGAGGATGGAGAGAACTTTGGTATAGACCAGAAGTTCTGGGACAACCCTTGCTTCCCGAGAAAAACTAAAGAATTTCCAGAGAAATACATGCCCTCGCCTGTAAGAGAAGACAATTTCATCAGATCCTTACCAGCATCCGGCTCTAGGGCATCTGTGCGAGCGTCCATCAGACTATCCAGCAGATCCAGAAGACATCAGGACAACGGAGCTGTTCAAACTGATGATTTGAGGAGGAAGTCCTTGGAGATATTAGGGCTATCCGACCCCGGGGGTCAAGACCTGGAGGTCACATTCGAGAGGAACCTGGACCTGTCAGAGTCCGACAGTTTCCAGACGCCCACAGACCTGCTAGATGTGAACGAAGCGAACCAAATAGAAATCATGATCAGCGATATGAACGCGGATACCTAG

Protein sequence:

>DPOGS214408-PA
MAPQERTTCCEDSRQEDNLANIDLNNGFINKAFTNDVSDNVNTKNHTEEPKQTKKVKDFDDLLPYIGEFGWYQKILFLLMIPFAFFVSIVYFSQIFMTIVPEQHWCWIPELANLTAEERRALAIPAKSDGPFTHDRCNMYSADFSLALEQGRTTPDDEWKIIPCSHGWEYNRSDVPYETIASQLDWVCDKDNYAATAQAIFFCGAIVGGLVFGWIADKYGRIPALVGTNLVGFAAGVATAFCNTFWTFTLCRFLVGLAFDNCFTMMYILVLEYVGPKWRTFVANMSIAIFFTLGASLLPWVALWAGDWRRYALFISAPFIIAAATPWVVPESARWLLSQGKIDKALVIMKKFERVNKTKIPDKVLNEFTEATQQAIKDSEAHKSYSVLDIFKTPRLRKNALLLIVIWMAISLVFDGHVRNVGSLGLDIFLTFTIATATEFPADTFLTVVLDRWSVSIPGDPGTVRGQHLLQHRAAANISPDLPLLILGVLGIVGGGLCLLLPETMDTEMPQTLEDGENFGIDQKFWDNPCFPRKTKEFPEKYMPSPVREDNFIRSLPASGSRASVRASIRLSSRSRRHQDNGAVQTDDLRRKSLEILGLSDPGGQDLEVTFERNLDLSESDSFQTPTDLLDVNEANQIEIMISDMNADT-