Monarch geneset OGS2.0

DPOGS203255
TranscriptDPOGS203255-TA2148 bp
ProteinDPOGS203255-PA715 aa
Genomic positionDPSCF300210 + 267510-288744
RNAseq coverage127x (Rank: top 57%)
Annotation
HeliconiusHMEL0207902e-16478.02% 
BombyxBGIBMGA007038-TA1e-12375.93% 
DrosophilaEsp-PB2e-8743.48% 
EBI UniRef50UniRef50_E0VH851e-10549.45%High affinity sulfate transporter, putative n=1 Tax=Pediculus humanus corporis RepID=E0VH85_PEDHC
NCBI RefSeqXP_002425479.12e-10649.45%High affinity sulfate transporter, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3838656571e-10550.55%PREDICTED: sodium-independent sulfate anion transporter-like [Megachile rotundata]
NCBI nr blastxgi|3504203165e-10752.20%PREDICTED: sodium-independent sulfate anion transporter-like isoform 1 [Bombus impatiens]
Group
Gene OntologyGO:00068101.9e-40transport
GO:00550851.9e-40transmembrane transport
GO:00160211.9e-40integral to membrane
GO:00052151.9e-40transporter activity
KEGG pathway 
InterPro domain[119-364] IPR0115471.9e-40Sulphate transporter
Orthology groupMCL25308 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203255-TA
ATGATGAAAATAGATCTCCGGAGACTGGTAGGGCGGGTTTTCCCCATAGTTCAATGGTCGAGGTTATATGACGTGAACACCGCTGTAGGGGACCTCATAGCGGGGATTACCATCGCCCTAACTCTCATACCGCAGTCTATTGCATATGCTTCGTTAGCTGGATTCGAACCTCAGTATGGCCTGTACGCGTCATTTGCTGGTGGGTTCGTATACGCGTTACTGGGCACCTGTCCACAGATCAATATAGGGCCAACAGCCCTACTTTCCCTTCTCACATTCACTTACACAAACGGGACGAACCCTGACTTCGCTATTCTTCTCTGCTTCATCGGTGGCATCATTCAATTGATAGCTGGTGTAATCCAATTAGGTTTTCTTGTGGAATTTGTATCACTACCAGTCGTTTCTGGGTTCACATCAGCTGCAGCCATAACTATAGCGTCTTCCCAAATAAAAGGTCTTTTAGGTTTGAAATTTAAAGCAGAGAATTTCATTTCAACGTGGCGAGGAGTTTTACATCATATTGGTGAAACGAAGCTAGAAGACTCGCTGCTTGGACTCTCTTGCTGTATTGTACTTATGGGAATGAAGGCTCTTAAAGATGTTCGTCTCAAAGACAACGACGAGAAAAGTCGTCGGTCACAGATACTACAGCGATGTTTTTGGTTCGTGGGTGTGGCGAGGAACGCCGTGGTAGTGGTTACAGCTTCCATTATAGCGTTCTTCGTCCACCAGGACAAAGAGGTTCCTCTTATACTCACAGGTGATATAACTCCAGGTCTACCAATTCCACAGCTGCCACCTTTTAAATCGATGGAAGGTAATTCGACAATTACTACTGGTGAGATGTTATCTCACTTGGGTTCAGGACTGATAGTTGTACCCCTAGTCGGAGTGATATCCAATGTTGCCATCGCCAAAGCCTTTTCTAAAGGTAAGACATTGGACGCTACGCAAGAGATAGTGTCACTCGGGGCTTGCAACATTATAGGCTCTTTCTTCCGCTCATTTCCCGTGAACGGTTCGTTTACGAGGAGTGCTGTAAGTGATGCGTCAGGGGTTAGAACCCCCGCGGCAGGATTTTATACTGGTGATATAACTCCAGGTCTACCAATTCCACAGCTGCCACCTTTTAAATCGATGGAAGGTAATTCGACAATTACTACTGGTGAGATGTTATCTCACTTGGGTTCAGGACTGATAGTTGTACCCCTAGTCGGAGTGATATCCAATGTTGCCATCGCCAAAGCCTTTTCTAAAGGTAAGACATTGGACGCTACGCAAGAGATAGTGTCACTCGGGGCTTGCAACATTATAGGCTCTTTCTTCCGCTCATTTCCCGTGAACGGTTCGTTTACGAGGAGTGCTGTAAGTGATGCGTCAGGGGTTAGAACCCCCGCGGCAGGATTTTATACTGGTATAATCGTATTACTAACTCTGGGTGTGCTGACCCCCTACTTCTATTTCATACCTCGCTCCGCCCTCTCAGCGGTCATCGTGTGCGCTGTTCTTTATATGGTTGATATTAGTGTTATTGGAACCCTATGGAGGACAAACAGACTTGATTTGATACCACTATTCGGTACATTCCTAAGCTGTCTAGTGTTTGGTGTTGAGTTGGGTTTGGGCTGTGGAGTCGTGATTGACGTTCTGCTTCTATTGTACTACAATTCAAGACCGCAGTTAAATATTAAATTTGTCAATGACGACAATCTTCCACCCCATTACTCCGTAGAACCAGTAGGTAGTTTGAATTTCGCTAGTGCTGAGAAGGTTCGTTTAACATTGACCGCTTTAAAGAAATCGAACGAACTGACTGATATCCGGCTAGATAATAATTTACGAGTTATAAACGATGGGGTCAGCAATACAAGTGAATCAAGACCGCGGGCTGGTAATGTATTGGTGGTACATTGTAATTCACTGGTCAGACTGGACTACACATTTTTACAGAGCCTCAGTATGCTGGTAAGCGAATGGTCTCTCCATGGTCACATAGTATGGTGTGACGCCAGCCCGCGTATACAAGAACAGCTTAATAGCGTGTTACATGACGTCAAATTCTGTGATATTCAGTCACTATCTGTGGTACTGTTAGATTTAACAATGGCAGCACAAACAGGTCATTCCAGTGATACAAGACTTTAA

Protein sequence:

>DPOGS203255-PA
MMKIDLRRLVGRVFPIVQWSRLYDVNTAVGDLIAGITIALTLIPQSIAYASLAGFEPQYGLYASFAGGFVYALLGTCPQINIGPTALLSLLTFTYTNGTNPDFAILLCFIGGIIQLIAGVIQLGFLVEFVSLPVVSGFTSAAAITIASSQIKGLLGLKFKAENFISTWRGVLHHIGETKLEDSLLGLSCCIVLMGMKALKDVRLKDNDEKSRRSQILQRCFWFVGVARNAVVVVTASIIAFFVHQDKEVPLILTGDITPGLPIPQLPPFKSMEGNSTITTGEMLSHLGSGLIVVPLVGVISNVAIAKAFSKGKTLDATQEIVSLGACNIIGSFFRSFPVNGSFTRSAVSDASGVRTPAAGFYTGDITPGLPIPQLPPFKSMEGNSTITTGEMLSHLGSGLIVVPLVGVISNVAIAKAFSKGKTLDATQEIVSLGACNIIGSFFRSFPVNGSFTRSAVSDASGVRTPAAGFYTGIIVLLTLGVLTPYFYFIPRSALSAVIVCAVLYMVDISVIGTLWRTNRLDLIPLFGTFLSCLVFGVELGLGCGVVIDVLLLLYYNSRPQLNIKFVNDDNLPPHYSVEPVGSLNFASAEKVRLTLTALKKSNELTDIRLDNNLRVINDGVSNTSESRPRAGNVLVVHCNSLVRLDYTFLQSLSMLVSEWSLHGHIVWCDASPRIQEQLNSVLHDVKFCDIQSLSVVLLDLTMAAQTGHSSDTRL-