Monarch geneset OGS2.0

DPOGS216069
TranscriptDPOGS216069-TA1662 bp
ProteinDPOGS216069-PA553 aa
Genomic positionDPSCF300067 + 357459-364987
RNAseq coverage756x (Rank: top 17%)
Annotation
HeliconiusHMEL0089343e-14057.04% 
BombyxBGIBMGA008869-TA0.074.52% 
DrosophilaCG9864-PA7e-10946.81% 
EBI UniRef50UniRef50_D6WVB63e-12650.94%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WVB6_TRICA
NCBI RefSeqXP_967997.15e-12750.11%PREDICTED: similar to sodium-dependent phosphate transporter [Tribolium castaneum]
NCBI nr blastpgi|910880711e-12550.11%PREDICTED: similar to sodium-dependent phosphate transporter [Tribolium castaneum]
NCBI nr blastxgi|2700120892e-12653.38%hypothetical protein TcasGA2_TC006192 [Tribolium castaneum]
Group
Gene OntologyGO:00550852.9e-42transmembrane transport
GO:00160212.9e-42integral to membrane
KEGG pathway 
InterPro domain[124-522] IPR0161961.4e-65Major facilitator superfamily domain, general substrate transporter
[121-463] IPR0117012.9e-42Major facilitator superfamily
Orthology groupMCL14888 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216069-TA
ATGTTCCAAAATATACATGTCATAAATGTAACATCAAAGTCCAAATGTGGCCCCGATAATGTACCGCTAAATGAGTCTTTAAATCATGTTGATGTTTCTGATGTGAGGGAAGCACCGAGACAGGTGGCTGGTGGCACATTTGATTGGACAAAAGATCAACAGGCAACGATTCTCGGTTCATATTTCTGGTGCTATCCTATAACGTCGCTCATTGGTGGTATGGCATCCGAACGATGGGGTCCTAGATACGTGGTCTTAATAACGTCACTGTCCAAATGTGGCCCCGATAATGTACCGCTAAATGAGTCTTTAAATCATGTTGATGTTTCTGATGTGAGGGAAGCACCGAGACAGGTGGCTGGTGGCACATTTGATTGGACAAAAGATCAACAGGCAACGATTCTCGGTTCATATTTTTGGTGCTATCCTATAACGTCGCTCATTGGTGGTATGGCATCCGAACGATGGGGTCCTAGATACGTGGTCTTAATAACATCACTGGTCAGTGCTATATTGACAGCGTTAAGTCCAGCCGCTGCCAGACTTGATTACGTAGCCCTGGTTATAATACGATTCTTCCTTGGATGTGCTGGGGGTTTTATCTACCCTTCGCTTCATTGTTTGGTTGCTCGCTGGGCACCACCCGCCGAAAAAAAATTCGTGAGCGCTATGATGGGTGGAACTTTAGGGACAGTAGTAACCTGGTCACTCACTGGTCCTTTGTTAGAAAGATTCGGATGGGCTTCGGCGTTTTACGTACCAGCGGGACTAACATTTATTTGGTGTGGATTTTGGTGGTACCTTGTGGCAGACACGCCTTCAGAACACCCTAGAATTTCGGCGTCTGAAAGAAAATACATATTAGATGCCTTGGGAGATAAAGTTAAGAAATCAAAGGGTTTGCCGCCATTCAGAAGAATAATTACCTCATTCCCGTTCTTAGCGATGGTTATTCTACACTTTGGAAATCTGTGGGGATTATACTTTATCATGACGGTGGGACCAAAATTTGTATCAAGCGTTCTTGGTTTTGAATTGTCAGCGGCAGGAATAATATCTGCCTTGCCGTATCTTGCAAGATTAATACTTGCAACTATATTTGGGGCTATCGGAGACTGTATTTTATCTAGGAAGCTAATGACAACAACAACTATAAGAAAATTCTTCTGTCTCTTTTCTCACATTATCCCTGGAATCTTATTAGTTTTGCTGGTATATACTGGATGTTCCACAGCATTATCGGTTGCAATGATCACGATGTCAATGGGATTCAATGGCGCTGCAACGTTGACGAATCTCCAAAATCACCAAGACTTGGCTCCTAACTATGCAGGAACCTTGTATGGTATTGCAAATTTCATAGGCAGTACTGCTGGATTTTTCACACCTATGATAACTGCATATTTTACGAAGACCGGGGATAGTTTCGAACAATGGAGGCCAGTATTCTTTGTGGGAGCGTCTGTATATATTGTGTCTGCGATATTTTTCATACTATTTGGAACTGGCAACACTCAGGCTTGGAACTTCGATGATGAATCTAAGCAAGAAGGAAAAGAGGAAAAGGGTCGCGCCGACGATATGAGCGATATGAATGAAACAATAAAAAATAGTTATAAAGACCCGAAAGAAAATTCCATGAGTATTACAACGCGTACTTAA

Protein sequence:

>DPOGS216069-PA
MFQNIHVINVTSKSKCGPDNVPLNESLNHVDVSDVREAPRQVAGGTFDWTKDQQATILGSYFWCYPITSLIGGMASERWGPRYVVLITSLSKCGPDNVPLNESLNHVDVSDVREAPRQVAGGTFDWTKDQQATILGSYFWCYPITSLIGGMASERWGPRYVVLITSLVSAILTALSPAAARLDYVALVIIRFFLGCAGGFIYPSLHCLVARWAPPAEKKFVSAMMGGTLGTVVTWSLTGPLLERFGWASAFYVPAGLTFIWCGFWWYLVADTPSEHPRISASERKYILDALGDKVKKSKGLPPFRRIITSFPFLAMVILHFGNLWGLYFIMTVGPKFVSSVLGFELSAAGIISALPYLARLILATIFGAIGDCILSRKLMTTTTIRKFFCLFSHIIPGILLVLLVYTGCSTALSVAMITMSMGFNGAATLTNLQNHQDLAPNYAGTLYGIANFIGSTAGFFTPMITAYFTKTGDSFEQWRPVFFVGASVYIVSAIFFILFGTGNTQAWNFDDESKQEGKEEKGRADDMSDMNETIKNSYKDPKENSMSITTRT-