Monarch geneset OGS2.0

DPOGS204423
TranscriptDPOGS204423-TA1941 bp
ProteinDPOGS204423-PA646 aa
Genomic positionDPSCF300002 - 526728-531282
RNAseq coverage118x (Rank: top 58%)
Annotation
HeliconiusHMEL0062540.074.00% 
BombyxBGIBMGA007724-TA0.075.80% 
DrosophilaCG42575-PA5e-15545.82% 
EBI UniRef50UniRef50_B0W6785e-16247.41%Phosphate transporter n=5 Tax=Coelomata RepID=B0W678_CULQU
NCBI RefSeqXP_001844212.19e-16347.41%phosphate transporter [Culex quinquefasciatus]
NCBI nr blastpgi|1700326882e-16147.41%phosphate transporter [Culex quinquefasciatus]
NCBI nr blastxgi|3479715537e-16448.08%AGAP004251-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00160206.2e-216membrane
GO:00068176.2e-216phosphate transport
GO:00053156.2e-216inorganic phosphate transmembrane transporter activity
KEGG pathway 
InterPro domain[1-646] IPR0012046.2e-216Phosphate transporter
Orthology groupMCL11437 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204423-TA
ATGGATCCGTACTCGAAAGACTTACTGTGGTTAGTTATATGTGGTTTTATAGTAGCTTTTATTTTGGCGTTTGGTATCGGAGCTAACGACGTTGCAAATTCCTTCGGTACAAGCGTGGGTTCCAAAGTCCTCACACTTACACAAGCATGTATTCTAGCGACCATTTTTGAGATCGCTGGAGCTGTGTTAATTGGCTATAAGGTATCAGATACAATGCGCAAAGGTATACTGGATGTATCTTTGTATGCGGATGGAGGGGAACGTCTGTTGGCTGCTGGTTGTCTTGCAGCGCTGATAGCGAGCGCTGTCTGGTTGATTCTTGCTACAGGATTAAGCTTACCCGTGTCTGGGACTCATTCCGTGGTAGGAGCGACTGTTGGTTTTACGCTAACAGCAAAAGGTCCGATAGGCGTACGATGGTCCACACTCGGGGCAATAGTCTTATCCTGGTTTATATCTCCTGTGTTAAGCGGTGCAGTATCAGCGTTTCTATATTGGCTAGTCCGTAAATTCATTCTACGTTCTCCTCAGCCAATTAAGGCTGGATTGCACTCACTTCCGTTCTTTTATGGAGCAACAATTGCTGTTAATGTTTTGAGTGTTGTGCATGACGGCCCAAAATTGTTAGAAATGGATAACATACCGCTTTGGTTAGCCTTAGCCGGTTCATTGGCACTTGGTGCTATTGGAGCACTATTAGTGCGTGTTTTCCTGGTGCCCTACTACCGTCAACGTCTTGTACCGCCACACGTTAACTTTACTGTAGGACTTTCCAATGAAACAACACCAGCCAATACTCCAACACATAACAAGAACAGCACCGCCCAACGTCCAACTTCTCTGTTATCTGAAGACGGAAAACTCCTTGAAGTAATCACAGAAAGTGCAGAAATGGTTACACTTAGTGACGCTGATAAATCATCAGTCGGTGTTAAAGAAATGAATGCCAAGAATCGAGCTTTGTTAGCCACAATGGATGACTGCAGTATTCTGTCTCGAAGCTTAAGTCCTCCAAATAAATCCCGACTTCAATTAATAGATGCAGACCCGCAAATAAATACACTTAAATATATAGACGAAACCCTTAGTTGTTGCAAGAGTTTGGATTCCAATCAATTGGTAGGAATGGGCGAGAGCTATGATTCAAAGAATGGATTTTTAGGTGATTCATGTGACACAATAGCACGCGGAGAGTTTGGCCGATTATCAGCTTTGATGGCACTGCCAACAACCGTAGAATTCGACACGCCACCACCAAGGCTGGATAAGGATCCAGCGGTGGGCAGCGCTTGGAGCATCGAGTGTGACATACGCAGGTCCCGCTTAGCTGCACTTACTCCCAACTCTAGCGCGGCGCCTCTACTGCGAGCGGCCAGTCCTCCGCGCGCCCCACCGGCTGCACCACCAGACACTTTCCGCCTCTTCTCCTTCCTACAAGTTCTCACAGCCACCTTCGGGTCGTTTGCTCACGGAGGAAATGATGTCAGTAACGCGATAGGACCATTGGTGGCGCTCTGGCTGTTATATTCTGAAGGTGGCGCCCACGCGAAAGCGGAGACACCACTTGCAATACTTGTGTTCGGTGGCGTGGGTATAGCGCTGGGACTTTGGCTGTGGGGACGCCGAGTCATTCAGACGGTAGGAGAGGATCTGACCAGCATAACACCCGACACTAATAGAAACGCTATGAAACAAACAANNNNNNNNNNNACTATCGAGTTGGGAGCGGCTCTGACTGTGCTGGTGGCGTCGAAGGTGGGGCTACCGGTGTCCACCACGCACTGTAAGGTCGGCTCTGTGGTTTGTGTGGGATATTCCTCCGAAAATAAGGTGGACTGGAGCTTGTTTAGGAACATAATATTTGCGTGGGTGGTCACTGTACCGGCGGTGGCGGGAATGTCATCGCTCGCTATGCTTGCTCTTGAAAAATTCGTCGTTTAA

Protein sequence:

>DPOGS204423-PA
MDPYSKDLLWLVICGFIVAFILAFGIGANDVANSFGTSVGSKVLTLTQACILATIFEIAGAVLIGYKVSDTMRKGILDVSLYADGGERLLAAGCLAALIASAVWLILATGLSLPVSGTHSVVGATVGFTLTAKGPIGVRWSTLGAIVLSWFISPVLSGAVSAFLYWLVRKFILRSPQPIKAGLHSLPFFYGATIAVNVLSVVHDGPKLLEMDNIPLWLALAGSLALGAIGALLVRVFLVPYYRQRLVPPHVNFTVGLSNETTPANTPTHNKNSTAQRPTSLLSEDGKLLEVITESAEMVTLSDADKSSVGVKEMNAKNRALLATMDDCSILSRSLSPPNKSRLQLIDADPQINTLKYIDETLSCCKSLDSNQLVGMGESYDSKNGFLGDSCDTIARGEFGRLSALMALPTTVEFDTPPPRLDKDPAVGSAWSIECDIRRSRLAALTPNSSAAPLLRAASPPRAPPAAPPDTFRLFSFLQVLTATFGSFAHGGNDVSNAIGPLVALWLLYSEGGAHAKAETPLAILVFGGVGIALGLWLWGRRVIQTVGEDLTSITPDTNRNAMKQTXXXXTIELGAALTVLVASKVGLPVSTTHCKVGSVVCVGYSSENKVDWSLFRNIIFAWVVTVPAVAGMSSLAMLALEKFVV-