Monarch geneset OGS2.0

DPOGS203614
TranscriptDPOGS203614-TA1599 bp
ProteinDPOGS203614-PA532 aa
Genomic positionDPSCF300063 + 159143-168316
RNAseq coverage633x (Rank: top 20%)
Annotation
HeliconiusHMEL0173270.077.22% 
BombyxBGIBMGA007281-TA0.077.80% 
Drosophilal(2)01810-PA2e-13850.65% 
EBI UniRef50UniRef50_B2DBK30.089.37%Similar to CG5304-PA n=3 Tax=Papilionoidea RepID=B2DBK3_9NEOP
NCBI RefSeqXP_001809051.11e-14152.39%PREDICTED: similar to sodium-dependent phosphate transporter [Tribolium castaneum]
NCBI nr blastpgi|1839792980.089.37%similar to CG5304-PA [Papilio xuthus]
NCBI nr blastxgi|1839792980.089.71%similar to CG5304-PA [Papilio xuthus]
Group
Gene OntologyGO:00550856.5e-54transmembrane transport
GO:00160216.5e-54integral to membrane
KEGG pathway 
InterPro domain[1-483] IPR0161963.9e-80Major facilitator superfamily domain, general substrate transporter
[49-447] IPR0117016.5e-54Major facilitator superfamily
Orthology groupMCL10166 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203614-TA
ATGACTGACCTAAATCAAATACAAGCCCACAACTTACTTGGCTTAAAGAACCAGCCAAGAAAACCGAAGGACACAGCTTTTACGAGAGCTTTGCGATCATGTTGTGTCATACCCCAAAGGTACATTCTAGGAGTTATGGGACTTTTGGGCGTGTGCAATGCGTATACAATGAGAGTCTGTTTGAACTTGGCAATTACACAAATGGTCAATAAGACCAAAAGTGGCACAGAACATTTTGATCCCGACGCATGTCCAAGTGATATTGAAGATTCTAATTCCACAAATATTCTACGACCATACGCGACCTTTGATTGGGATGAAAAAACTCAAGGTTTAATTCTAAGTGGATTTTACTACGGGTATGCAGCGACTCAAGTACCTGGTGGATATTTAGCTGAGAAGTTTGGAGGAAAATGGACATTAGGAATTGGTTTACTTAGTACCGCTTTATTTACTTTTCTAACACCAATAGTTATCAGAGCTGGAGGGGCGACATGGCTCTTTATACTGCGGGTCTTGCAAGGAATGGGGGAAGGTCCGACGATGCCAGCTTTAATGATAATGTTAGCGAGATGGGTGCCACCGCATGAACGTTCGTTTCAAGGGGCCTTAGTATTTGGCGGTGCACAAATAGGAAATATATTTGGCTCTTTCATGTCTGGTATTTTGTTAGCTGATGGAAGAGATTGGGCATATGTATTCTATTTCTTCGGTGGCTTCGGCCTTGTGTGGTTTACTTTGTGGAGTTTGCTTTGCTATAGCACACCAAATACTCATCCTTACATATCAAAGAAAGAACTTAACTATCTCAACAAGAATGTTACAACTGCGGAGAGTATTACAGCAAAGGATCCAGTGCCTTGGAAGGCGATCCTGAGATCTGCTCCTGTATGGGCCCTTGTATGGGCTGCTGTCGGACACGATTGGGGTTATTACACTATGGTGACAGACTTGCCGAAATACTCACACGATGTGCTTAAATTTAACATTGCGACGACTGGAACTCTGACTGCCTTACCTTATATAGCTATGTGGTTATGTTCCTTTCTGTTTGGATTTGTGTGCGACCTCTGCATCAAGAAAGGGTGGCATACTATTAAGACGGGTAGAATTATTCACACTACCATAGCGGCCACTGGACCTGCAATATGTATTATCTTGGCTTCTTACGCTGGATGTGACAGAACTGCTGCTATGGTGTACTTCATCTTGTCTATGGCTCTTATGGGAGGTTTTTACAGTGGTATGAAGGTAAACGCATTGGATCTGGCACCGAATTATGCAGGTTCGCTGACATCGCTAGTAAACACAACTTCTACATTCGCTGGTATTGTGACACCATACCTTATTGGGTTATTGACACCTGATTCAACATTAGCCCAATGGCGGATAGCGTTCTGGGTGTGTTTTGCTGTGTTAGTTGGTACAAACGTAGTGTACTGCATTTGGGCTGACGGTGAACAGCAGTGGTGGGATGATGTAAGGAAACTCGGTTACCCAGCGGATTGGAAGCACGGATCCTTAATACCTGATGGAAATCCCGAACAACCAGAGACTGTGAGATTATCGAGCAACAAAACATCGGATGATGTATATTAG

Protein sequence:

>DPOGS203614-PA
MTDLNQIQAHNLLGLKNQPRKPKDTAFTRALRSCCVIPQRYILGVMGLLGVCNAYTMRVCLNLAITQMVNKTKSGTEHFDPDACPSDIEDSNSTNILRPYATFDWDEKTQGLILSGFYYGYAATQVPGGYLAEKFGGKWTLGIGLLSTALFTFLTPIVIRAGGATWLFILRVLQGMGEGPTMPALMIMLARWVPPHERSFQGALVFGGAQIGNIFGSFMSGILLADGRDWAYVFYFFGGFGLVWFTLWSLLCYSTPNTHPYISKKELNYLNKNVTTAESITAKDPVPWKAILRSAPVWALVWAAVGHDWGYYTMVTDLPKYSHDVLKFNIATTGTLTALPYIAMWLCSFLFGFVCDLCIKKGWHTIKTGRIIHTTIAATGPAICIILASYAGCDRTAAMVYFILSMALMGGFYSGMKVNALDLAPNYAGSLTSLVNTTSTFAGIVTPYLIGLLTPDSTLAQWRIAFWVCFAVLVGTNVVYCIWADGEQQWWDDVRKLGYPADWKHGSLIPDGNPEQPETVRLSSNKTSDDVY-