Monarch geneset OGS2.0

DPOGS201201
TranscriptDPOGS201201-TA1503 bp
ProteinDPOGS201201-PA500 aa
Genomic positionDPSCF300262 + 535123-540716
RNAseq coverage50x (Rank: top 70%)
Annotation
HeliconiusHMEL0159033e-15659.25% 
BombyxBGIBMGA014244-TA3e-5230.93% 
DrosophilaCG15096-PA3e-4626.30% 
EBI UniRef50UniRef50_E2BGX25e-5128.60%Putative inorganic phosphate cotransporter n=13 Tax=Formicidae RepID=E2BGX2_HARSA
NCBI RefSeqXP_393759.16e-5228.27%PREDICTED: similar to Picot CG8098-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|480973621e-5028.27%PREDICTED: putative inorganic phosphate cotransporter [Apis mellifera]
NCBI nr blastxgi|3454801868e-5329.18%PREDICTED: putative inorganic phosphate cotransporter-like isoform 2 [Nasonia vitripennis]
Group
Gene OntologyGO:00550852.5e-26transmembrane transport
GO:00160212.5e-26integral to membrane
KEGG pathway 
InterPro domain[1-454] IPR0161968.4e-42Major facilitator superfamily domain, general substrate transporter
[36-416] IPR0117012.5e-26Major facilitator superfamily
Orthology groupMCL34389 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201201-TA
ATGGACGTTGAATTTTTAAGACATAAATATGAAGACCGACACGACGGTGTCATCACAAAAGCGAAGTTTAAGCTTGGGGTTCGTCACATTCAAGTGGTCTACATGTTCGTCTGTTCGTTCGCGATGGGGGCGTTAAGGAGCAGCAACAGCATCGCGATCCTTTCTGTAGCGCAACAAAGCAGAAACAATGGTTCATACTTACAGAATCACAATTGGGACAAAAGAATTCAAGGCACTGTTCTCTCATCATTCTTGGTCGGATACGCCTCAGCGATGCTTCCAGCTGAGTTATATCTGAAGGAAGTTGATGATAAATTTATAATGGCAGCCGTGCTTCTCATTAACGGAGGACTGACGGCAGCTATGCCAACTATTGTTAAAAAGGGTGGATGGATCGCTGTTAGCAACGGAGAATTCTTGATGGGTATAACTCAAGCCTGTCTGAGTCCAGTGAATCTTAATCTTATTTCCAAATGGCTTCCACCCAGCGAGAGGTCACTGTGTGGATTTTTAATTCAAGGAGCAATAATTCTAGGCACCATAATTGCATTGCCTGTGTCAGGTATAATTTCGGAATCACGCTTGGGTTGGGAACTCATATTTTATTCTCAGGCAATGATGACGCTGTCGACAGCCACGTTATGGGTGTTACTGACTTCTAGTACCCCAGAAACGCACCACGCTATAGGAGATACTGAAAAAGAGTATATTAGACGCTCCCTGTCATGTTACCGACAGAATAAACTCCATAAACCTTGGCGAAATATCCTTCAAACAAAACAATTTTGGGCAATTGCATCGGCTCATACCGCAGCAAACGTGCTCTACGTTTTCTTTTTAGTCGAATTATCCTCGTTTCTCGTTTCAATGGATTTATCTATTAAGAACTCGGGTACACAAACAGCTCTACCTTTTGTGGGGATGTGGGTGGTTTATCTGCTGACTTCGCCTACAATAGAATTAATTTATGGTATCGGAAATGTTAACTACTTGTTCGACGTCAAATATTTCAGGAAAATAGTTAATGGATTCGGATCCCTTGGGATAGTAATCGGATTACTAACTCTACCCTATTTGGTTCCGTCGTGGAATCATCTAGGGCTCATAACATTGGTCGGAACTTTTTCATTACTTGGGATTCAATATTCCGGTTTTCTTGAAAATCACAAAGATATGACACAGAATTACTCGAACACTTTACTTGTGCTGAGCAACATAGTCTCTAGTGCTGTAGCAGCTCTCGTTCCTGTGCTCACTGCAGCTATCGTCAACGTTGATGAGGGTGATTTAAATCGCTGGAAAATTATATTTCGCTTGCTCGCTGGTTTCTATGTTATTTGCAATGTCGTCTACACTTTATGCGCAAACAGCGACAGGCAAGAATGGGACAGAAGCGCTAAAACAAAGTTCGGATATTGCAACGCCTTAACTAACTTGGAATTGGACGAGATCAATCAAACTACTAGACTGGACTGTCAGAGAGAAGATGATACATCAGTGTAA

Protein sequence:

>DPOGS201201-PA
MDVEFLRHKYEDRHDGVITKAKFKLGVRHIQVVYMFVCSFAMGALRSSNSIAILSVAQQSRNNGSYLQNHNWDKRIQGTVLSSFLVGYASAMLPAELYLKEVDDKFIMAAVLLINGGLTAAMPTIVKKGGWIAVSNGEFLMGITQACLSPVNLNLISKWLPPSERSLCGFLIQGAIILGTIIALPVSGIISESRLGWELIFYSQAMMTLSTATLWVLLTSSTPETHHAIGDTEKEYIRRSLSCYRQNKLHKPWRNILQTKQFWAIASAHTAANVLYVFFLVELSSFLVSMDLSIKNSGTQTALPFVGMWVVYLLTSPTIELIYGIGNVNYLFDVKYFRKIVNGFGSLGIVIGLLTLPYLVPSWNHLGLITLVGTFSLLGIQYSGFLENHKDMTQNYSNTLLVLSNIVSSAVAALVPVLTAAIVNVDEGDLNRWKIIFRLLAGFYVICNVVYTLCANSDRQEWDRSAKTKFGYCNALTNLELDEINQTTRLDCQREDDTSV-