Monarch geneset OGS2.0

DPOGS203611
TranscriptDPOGS203611-TA1755 bp
ProteinDPOGS203611-PA584 aa
Genomic positionDPSCF300063 + 107004-116594
RNAseq coverage106x (Rank: top 60%)
Annotation
HeliconiusHMEL0034492e-14381.27% 
BombyxBGIBMGA007278-TA2e-14484.23% 
Drosophilal(2)01810-PA2e-7247.12% 
EBI UniRef50UniRef50_UPI000179304E1e-11238.31%UPI000179304E related cluster n=5 Tax=unknown RepID=UPI000179304E
NCBI RefSeqXP_001653764.14e-12143.57%sodium-dependent phosphate transporter [Aedes aegypti]
NCBI nr blastpgi|1571210797e-12043.57%sodium-dependent phosphate transporter [Aedes aegypti]
NCBI nr blastxgi|1839792982e-16453.09%similar to CG5304-PA [Papilio xuthus]
Group
Gene OntologyGO:00550851.3e-41transmembrane transport
GO:00160211.3e-41integral to membrane
KEGG pathway 
InterPro domain[101-548] IPR0161961.8e-61Major facilitator superfamily domain, general substrate transporter
[84-308] IPR0117011.3e-41Major facilitator superfamily
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203611-TA
ATGACTGCGTCTACCTACTACCGGCTATCGATATCAACTAGTGTATTTATAATACCACAGCGTTATGTCTTCTCCATAATGGCCTTGCTTGCGGTCGCTAACGCGTACACAATGCGAGTATGTCTTAATTTGGCCATAACGCAAATGGTTAAAAGGACGGTTGCTGTGGAAGGTGATCCTAATTACGATCCCGATGCGTGTCCTGATCCCAATGTTGATACAGACATCGCATCCAAGACATTGGCTAACATTACATTGACTATGTCCCATTATCTTCGTCAGGATACAGGGCCCGTACTTTTCGAATGGAGCGAAGCAACGCAAGGTCTTGTCCTCAGCGCATTTTACTATGGTTACGTGCTTACTCACATACCCGGAGGTATTATTGCTGAACGTTATGGTGGTAAATGGGTTCTCGGATTAGGTCTACTGTCCACTGCTCTCTGTACATTTATCACACCTTTTGCTGTTAAAACTGGTGGAGCAACTGCCTTGTTTATACTTCGTGTTGTTGAAGGGTTTGGAGAGGGGCCAACAATGCCAGGACTTATGGCAATGATATCTAAATGGGCACCAAAATCTGAAAGGGCTCGTATAGGAGCAATTGTTTTTGGAGGCGCACAAATTGGAAACATTGCTGGATCTTATTTCTCGGGTCTGATTATTCATGCAGGCTCTTGGGAAAATGTATTTTATATGTTTGGTGGATTTGGACTGGCTTGGTTTGCTATTTGGTCCGTACTCTGTTATAGCACACCAAATACACACCCTTTTATATCGGATAAAGAGAAAAAATTCTTGAATGAAAATGTACAAGCCCTCATCCATAGCGAGAAACAAATTTTAGATCCAGTGCCATGGAAAGCTTTACTCAGATCTGTACCTTTATGGTCTTTGATAATTGCTGGTTCTCTACTCTGTTATAGCACACCAAATACACACCCATTTATATCGGATAAAGAGAAAAAATTCTTGAATGAAAATGTACAAGCCCTCATCCATAGCGAGAAACAAATTTTAGATCCAGTACCATGGAAAGCTTTACTCAGATCTGTACCTTTATGGTCTTTGATAATTGCTGGTATTGGTCACGACTGGGGCTACTTCACAATGGTGACAGATTTGCCTAAGTACATGACAGATGTGCTTAAATTTAACATTAAGTCAGCTGGATTACTATCAGCGTTACCGTACGTCGCTATGTGGATTGCTTCGTTTTTCTTTGGTTTGCTATGTGACTTCTGCACCAAGAGAAAGTATCATAGTATTCAGAATGCAAGAAAAATTTACACTACAATTGCGGCAACTGGACCAGGTATCTGCATTATCTTAGCATCGTATTCTGGTTGTGACACCACTCTTGCTGTCTTTTGGTTTATCGCTGCTATGACCTTGATGGGTGCTTACTACAGTGGAATGAAAATAAACGCATTGGACATAACACCGAATTACGCTGGTACAACAACAGCAATGGTTAATGGAATTGCTGCTATTTCTGGCATCATTTCACCTTACCTGATAGGTTTACTCACTCCACATTCAACTTTGAAAGAATGGAGAATCGCATTCTGGGTTTGCCTTGCCATTTTGGTAATAACCAACATTATTTATCTAATATTCGCTAAAGGGGAACAGCAATGGTGGGACGATGTTAAGAGGTATGGTTATCCAGAGAATTGGAAACACGGTCCTCTGCCAGTGAGACAAAATGACACAGAAGGATCAAAGAGTGAAAAAACCAAAGAGACAAAATAA

Protein sequence:

>DPOGS203611-PA
MTASTYYRLSISTSVFIIPQRYVFSIMALLAVANAYTMRVCLNLAITQMVKRTVAVEGDPNYDPDACPDPNVDTDIASKTLANITLTMSHYLRQDTGPVLFEWSEATQGLVLSAFYYGYVLTHIPGGIIAERYGGKWVLGLGLLSTALCTFITPFAVKTGGATALFILRVVEGFGEGPTMPGLMAMISKWAPKSERARIGAIVFGGAQIGNIAGSYFSGLIIHAGSWENVFYMFGGFGLAWFAIWSVLCYSTPNTHPFISDKEKKFLNENVQALIHSEKQILDPVPWKALLRSVPLWSLIIAGSLLCYSTPNTHPFISDKEKKFLNENVQALIHSEKQILDPVPWKALLRSVPLWSLIIAGIGHDWGYFTMVTDLPKYMTDVLKFNIKSAGLLSALPYVAMWIASFFFGLLCDFCTKRKYHSIQNARKIYTTIAATGPGICIILASYSGCDTTLAVFWFIAAMTLMGAYYSGMKINALDITPNYAGTTTAMVNGIAAISGIISPYLIGLLTPHSTLKEWRIAFWVCLAILVITNIIYLIFAKGEQQWWDDVKRYGYPENWKHGPLPVRQNDTEGSKSEKTKETK-