Monarch geneset OGS2.0

DPOGS209261
TranscriptDPOGS209261-TA1896 bp
ProteinDPOGS209261-PA631 aa
Genomic positionDPSCF300111 + 374691-386268
RNAseq coverage88x (Rank: top 63%)
Annotation
HeliconiusHMEL0167340.073.27% 
BombyxBGIBMGA007041-TA0.077.57% 
DrosophilaEsp-PB9e-15145.94% 
EBI UniRef50UniRef50_E2BP270.059.32%Sodium-independent sulfate anion transporter n=22 Tax=Opisthokonta RepID=E2BP27_HARSA
NCBI RefSeqXP_002423734.10.060.55%sulfate transporter, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420057710.060.55%sulfate transporter, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420057710.061.30%sulfate transporter, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00068103.6e-55transport
GO:00550853.6e-55transmembrane transport
GO:00160213.6e-55integral to membrane
GO:00052153.6e-55transporter activity
KEGG pathway 
InterPro domain[127-417] IPR0115473.6e-55Sulphate transporter
Orthology groupMCL10140 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209261-TA
ATGTCTTCACCGTCCAAACAACGCTATAAAAAGCAAACGTGGTGCAAGCGATTGTTACATAAGAGGCTTCCAATCACAAAATGGCTGTCTGAATATAATTCCGAAAAGGCACTCGCTGACTTCATAGCTGGTGTGACAGTGGGGTTAACTGTGATACCACAAGCTTTGGCGTACGCGACCCTCGCTGGCCTTCCCCCACAGTACGGCCTCTACTCGTCGTTTATGGGATGTTTTGTATATATTTTATTTGGTTCATGCAAAGATATAACTCTGGGACCAACAGCGCTCCTGGCGCTCATGACATATGAACAAATACAGGGTAGAAACTTCGACTACGCTGTGCTTCTGTGTTTCCTAACAGGAGTTGTACAGCTGGCGATGGGAATATTGCATTTAGGTGTTCTAATAGATTTCATATCAGTCCCAGTGACGGTAGGTTTCACTTCAGCAACATCAGTTATAATAGCTGTGTCGCAACTCAAAGGACTTTTGGGACTACAATTTAAGTCGAGAGGATTTTTAGACACATTGAAAAAGGTATTTCAAAACTTACCGAATGCTAAATTAGCTGATAGTACGTTAGGTGTATCGTGCATAGTTATACTATTATTAATGAGGAAAATGAAAGATTTAAATTTGGGGCAAGAGCGAAAAGGTTTGAAGAAAGCTTTATGGCTATTATCAACTTCGCGGAACGCTATCATCGTGCTCCTTTGCTCTTTAATGGCTTTCGCATGGGAAAAGTATTCAGAATCTCCGTTCAAACTCACTGGCACCGTCAAAGAGGGCTTGCCGCTATGGTCCATGCCGCCGTTCGCTACATCATACGGGGGCACAAACGTTACCTTTATTGATATGTGCTCAGACCTCGGCTCGTCGATAATACTGGTACCCATTATAGGAGTACTTGGAAACGTCGCTATAGCTAAGGCATTTGCTAGCGGTGAATCCGTAGACGCTACGCAGGAACTTATAACCCTCTCACTATCCAATATACTCGGTTCATTTGTGAGCGCGATGCCAATAACCGGCTCGTTTTCACGAAGCGCTGTGAACCATGCTAGTGGAGTTTGTACACAATTTGGCAGTGTTTATACAGGTATTCTAGTTCTTCTAGCACTGAGTCTTTTGACGCCATATTTCTATTTTATTCCGAAAGCCTCGCTCGCCGCTGTCGTGATATGTGCTGTGGTTTTCATGATTGAGTACGAGGTCGTAAAGCCTATGTGGCGCTCACGCCGGGCGGATCTTGTGCCAGCCTTCGCTACGTTCGCTGTGTGTTTGGTTGTTGGAGTTGAAATCGGTATAGTGGCTGGGGTTCTGTTGAACGTGTTACTACTCTTGTACCCCAGCGCCAGACCGCAGATGGAAGCCGAAATTGTTACGAACTCGTCCGGATCTAACTACTTATTGATAACAGTGGGCAATAGTCTTTACTTCCCTGGCGTGGAATACATAAGACAGTACGTGAGCCGCGCCGCGAAGAAGCAGGGAGGCTGCAGCATGCCCGTTGTCATCGACTGCAGATATGTACTTGGTGCAGACTTTACTGCCGCTAAGGGTATCTGTGCGTTGTCAAATTCGTTAGCATCACGCGGCCAGCCGCTAGTACTGTTAGCGCCAAGACAATGTGTCGCAGACGTCTTCATAGGCGCCGGTTCGAGTGTTGTCGTGGTGATGACGGCCAATGAATTGGACGATACATTACAAGATTTAACAAATCAAATAGCCCTGAAGGACATCAATGGGAAAGAGAAAAAAATAACCCCTCCGCCTTCATACAACACCCTCACACAGATAGGCGACGAAGTTGAAGTCATTATATCAGACAGTAATAACCTTCCATTACTGAGCAATCGTCATAAAACAGTGTCCGATGAGGTCAATGACACGTGA

Protein sequence:

>DPOGS209261-PA
MSSPSKQRYKKQTWCKRLLHKRLPITKWLSEYNSEKALADFIAGVTVGLTVIPQALAYATLAGLPPQYGLYSSFMGCFVYILFGSCKDITLGPTALLALMTYEQIQGRNFDYAVLLCFLTGVVQLAMGILHLGVLIDFISVPVTVGFTSATSVIIAVSQLKGLLGLQFKSRGFLDTLKKVFQNLPNAKLADSTLGVSCIVILLLMRKMKDLNLGQERKGLKKALWLLSTSRNAIIVLLCSLMAFAWEKYSESPFKLTGTVKEGLPLWSMPPFATSYGGTNVTFIDMCSDLGSSIILVPIIGVLGNVAIAKAFASGESVDATQELITLSLSNILGSFVSAMPITGSFSRSAVNHASGVCTQFGSVYTGILVLLALSLLTPYFYFIPKASLAAVVICAVVFMIEYEVVKPMWRSRRADLVPAFATFAVCLVVGVEIGIVAGVLLNVLLLLYPSARPQMEAEIVTNSSGSNYLLITVGNSLYFPGVEYIRQYVSRAAKKQGGCSMPVVIDCRYVLGADFTAAKGICALSNSLASRGQPLVLLAPRQCVADVFIGAGSSVVVVMTANELDDTLQDLTNQIALKDINGKEKKITPPPSYNTLTQIGDEVEVIISDSNNLPLLSNRHKTVSDEVNDT-