Monarch geneset OGS2.0

DPOGS203613
TranscriptDPOGS203613-TA1356 bp
ProteinDPOGS203613-PA451 aa
Genomic positionDPSCF300063 + 140856-145014
RNAseq coverage744x (Rank: top 17%)
Annotation
HeliconiusHMEL0173270.072.21% 
BombyxBGIBMGA007281-TA0.079.67% 
Drosophilal(2)01810-PA4e-10641.77% 
EBI UniRef50UniRef50_Q9VKC96e-10441.77%LD14545p n=17 Tax=Neoptera RepID=Q9VKC9_DROME
NCBI RefSeqXP_968382.17e-12244.78%PREDICTED: similar to sodium-dependent phosphate transporter [Tribolium castaneum]
NCBI nr blastpgi|910892311e-12044.78%PREDICTED: similar to sodium-dependent phosphate transporter [Tribolium castaneum]
NCBI nr blastxgi|1839792980.071.52%similar to CG5304-PA [Papilio xuthus]
Group
Gene OntologyGO:00550851.3e-34transmembrane transport
GO:00160211.3e-34integral to membrane
KEGG pathway 
InterPro domain[1-413] IPR0161963.1e-60Major facilitator superfamily domain, general substrate transporter
[27-227] IPR0117011.3e-34Major facilitator superfamily
Orthology groupMCL10166 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203613-TA
ATGATACCGAAATGGAGACGGCAAATAACAAAGTTATTTTTAATACCCCAGCGGTATGTTCTCGGAATCATGGGTCTCTTGGCGGTGTGCAACGCATACACCATGCGAGTTTGTTTAAATTTGGCTGTTACACAGATGGTCAATAATACTAAGAATGAGGAATCGCACTTCGATCCTGATGCCTGTCCTGACGAGAGCGAGATCTTCGTCAATGGAACTGTTACGTCGAAACCACATGCAATATTTGAATGGTCCGAATCGACGCAGGGGTTAATTCTGAGTGGTTTCTATTATGGATATGCTATTACACATGTTCCGGGGGGTTATTTAGCAGAAAGGTACGGTGGAAAATGGACATTAGGAATTGGACTCCTGAGTACAGCTATTTTTACCCTCCTAACACCAATTGTTGTGAAGGCTGGCGGAGCGACATGGTTATTTATACTTCGTGTTCTACAAGGGATGGGAGAGGGTCCCACAATGCCCGCTCTGATGATAGTTCTAGCTAGATGGGTTCCTCCACACGAGCGCTCACGTCAGGGTGCTGTCGTGTTTGGTGGAGCTCAGATCGGTAACATCTTCGGTTCTTTCATGTCTGGTTTCCTAATGGCGGGTGGAGGCGACTGGGCCAATGTATTTTACTTCTTTGGCTGTTTCGGGATTCTGTGGTTTGTTGCTTGGGTGGGACATGATTGGGGATATTACACTATGGTGACAGATTTACCGAAGTACATGACTGATGTCTTAAAGTTCAATATTGCTACAACGGGTACCCTCACAGCGATCCCATATTTGGCTATGTGGATAAGTGCCTTCATATTTGGATGGGTTTGCGACGTATGTGTTCAGAGGAACTGGCATTCCATTAAAACTGGAAGGATAATTCATACAACAATAGCGGCCACTGGACCAGCAATTTGTATAATACTAGCGTCATACGCCGGGTGCGATCGTTTTGCTGCCGTTGCATATTTTATAGCGTCCATGGCTCTTATGGGAGGCTTCTATAGTGGGATGAAGGTGAATGCTCTTGACCTGGCGCCCAATTATGCAGGTTCCTTGACTGCAATGATTAATGGCACTTCAACAATATCTGGCATCATAACACCGTATCTGATAGGCCTTTTAACTCCTGATTCGACATTAAAACAATGGCGAGTGGCTTTCTGGGTATGTTTTGCTGTTCTCGTCGGAACGAATGTTATTTACAATATCTGGGCTGATGGCAAACAGCAGTGGTGGGACGATGTTAGACAATATGGTTACCCCGAAAATTGGAAGCATGGTCCACTCAAGATGGCTAGTAAGGATAGCGAAAAAAATAAGAAGAAAGAAGCTCAAGGAACTGCACTTTAG

Protein sequence:

>DPOGS203613-PA
MIPKWRRQITKLFLIPQRYVLGIMGLLAVCNAYTMRVCLNLAVTQMVNNTKNEESHFDPDACPDESEIFVNGTVTSKPHAIFEWSESTQGLILSGFYYGYAITHVPGGYLAERYGGKWTLGIGLLSTAIFTLLTPIVVKAGGATWLFILRVLQGMGEGPTMPALMIVLARWVPPHERSRQGAVVFGGAQIGNIFGSFMSGFLMAGGGDWANVFYFFGCFGILWFVAWVGHDWGYYTMVTDLPKYMTDVLKFNIATTGTLTAIPYLAMWISAFIFGWVCDVCVQRNWHSIKTGRIIHTTIAATGPAICIILASYAGCDRFAAVAYFIASMALMGGFYSGMKVNALDLAPNYAGSLTAMINGTSTISGIITPYLIGLLTPDSTLKQWRVAFWVCFAVLVGTNVIYNIWADGKQQWWDDVRQYGYPENWKHGPLKMASKDSEKNKKKEAQGTAL-