Monarch geneset OGS2.0

DPOGS201200
TranscriptDPOGS201200-TA1314 bp
ProteinDPOGS201200-PA437 aa
Genomic positionDPSCF300262 + 530060-533881
RNAseq coverage49x (Rank: top 70%)
Annotation
HeliconiusHMEL0159025e-13961.75% 
BombyxBGIBMGA014244-TA2e-8135.84% 
DrosophilaCG15096-PA2e-7534.52% 
EBI UniRef50UniRef50_E2BGX21e-7634.72%Putative inorganic phosphate cotransporter n=13 Tax=Formicidae RepID=E2BGX2_HARSA
NCBI RefSeqXP_624251.24e-7836.38%PREDICTED: similar to CG15094-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3800232057e-7736.26%PREDICTED: putative inorganic phosphate cotransporter-like [Apis florea]
NCBI nr blastxgi|3800232051e-7736.19%PREDICTED: putative inorganic phosphate cotransporter-like [Apis florea]
Group
Gene OntologyGO:00550858.7e-51transmembrane transport
GO:00160218.7e-51integral to membrane
KEGG pathway 
InterPro domain[8-400] IPR0161963.5e-68Major facilitator superfamily domain, general substrate transporter
[17-362] IPR0117018.7e-51Major facilitator superfamily
Orthology groupMCL34388 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201200-TA
ATGGGCGTCGCTGTTCTGGCGATGACGGATGGCAGCCGCGACGATATTGAAACGTATAAATGGGACAAGAAGACCCAAGGGATCATTCTGTCCTCCTTCTTTTGGGGCTACACGGTGATGCAGATACCTGCCGGTATACTGTCTAAACGTTATGGGGGGAAGTCGATTCTTCTGATATCGCTTCTAACTAACACTGTTATATGTGGCCTATTACCAACTCTTGTAAAAATCTGGGGTTGGCCAATAGTTTGCGTATGCCGCGTCATTATGGGGTTAACTCAAGCTTGCCTTTTCCCGGCTACTCATACCTTACTAGGTCGGTGGTTACCAAAGCAGGAGAGGACCTCCTATAGTGGAATTGTATACGGAGGAACTCAAATCGGAATCATAATTGCGATGCCAATATCTGGATTATTAGCTGATACCAAGTACGGATGGAAATCTATATTTTACACAGTGTCAGCTACAATGCTTGTTACTGCTGGTGTATGGTCTTTCTTCGCAGCCAATATGCCGAGAGAACATCGAATGATGACGGAACAGGAAAGGCATTATATTGAAAGAGGTTTAAATACGGTAGAAGCTAAGGCACTTCGTACTCCTTGGCGCCATATATTCCAAACAAGAGCGTTATGGGCTATCATGGTAACTCACATCGGGAGTTCAGTCAGTTTTGTCCTGTTCTTTGTAGATTTACCCACGTACATAGAACGCGGCCTGAAAATAAGCTTAAAAAATAGTGCTGTTCTGTCAGCGTTACCGTACTTAGGAATGTGGATTGGTAACATTTCCTCGACGTATATTTGCGAGAAAATTTACAATCGGAACTATTGGTCCTTGTTAACTTGCAGAAAAGTCTTTAATTCAATTTCTTTCTTTGGATTATCTATTGGCTTAATCGTCTTAGCTTTCTTAGGGCCTGAAAACAAATCCTTGGCCATCATTACTTTAGTCATGTCATTATTCTTAAGTGGGTTTTCTGCTGCTGGTTTTTTGATGTCTTTTTTGGATCTTTCGCCGAACTATGCTGGTGTAACGCTTTCTCTTTCAAATGCAGCTGCGAACTTTGGTAGTATTTTAACACCAATTGGGACAAGCTTAGTACTAAAAAATGACCCTACTGACACCAGTAGATGGCTAATAGTATTCCTGAGCACGGCGCTGTTGTGTGTCATCGCTAACTGTGTGTTCTTGATGTTCGCTAGCTCCACGCAGGTGGAATGGGATGATCCGAACTTTATTGAGAAGACAATGGCTGATAAAGAGGAAGTAATACCAGCTTTGAAGACTGTGAAGGAAGATGCAGAACGATAA

Protein sequence:

>DPOGS201200-PA
MGVAVLAMTDGSRDDIETYKWDKKTQGIILSSFFWGYTVMQIPAGILSKRYGGKSILLISLLTNTVICGLLPTLVKIWGWPIVCVCRVIMGLTQACLFPATHTLLGRWLPKQERTSYSGIVYGGTQIGIIIAMPISGLLADTKYGWKSIFYTVSATMLVTAGVWSFFAANMPREHRMMTEQERHYIERGLNTVEAKALRTPWRHIFQTRALWAIMVTHIGSSVSFVLFFVDLPTYIERGLKISLKNSAVLSALPYLGMWIGNISSTYICEKIYNRNYWSLLTCRKVFNSISFFGLSIGLIVLAFLGPENKSLAIITLVMSLFLSGFSAAGFLMSFLDLSPNYAGVTLSLSNAAANFGSILTPIGTSLVLKNDPTDTSRWLIVFLSTALLCVIANCVFLMFASSTQVEWDDPNFIEKTMADKEEVIPALKTVKEDAER-