Monarch geneset OGS2.0

DPOGS201153
TranscriptDPOGS201153-TA1179 bp
ProteinDPOGS201153-PA392 aa
Genomic positionDPSCF300065 - 842-11260
RNAseq coverage42x (Rank: top 72%)
Annotation
HeliconiusHMEL0023602e-13171.62% 
BombyxBGIBMGA002910-TA2e-9953.87% 
DrosophilaCG5002-PA1e-8344.62% 
EBI UniRef50UniRef50_E1ZW292e-9150.82%Sodium-independent sulfate anion transporter n=7 Tax=Camponotus floridanus RepID=E1ZW29_CAMFO
NCBI RefSeqXP_001601834.13e-9550.67%PREDICTED: similar to sulfate transporter [Nasonia vitripennis]
NCBI nr blastpgi|3454831121e-9450.67%PREDICTED: sodium-independent sulfate anion transporter-like [Nasonia vitripennis]
NCBI nr blastxgi|3071967512e-9751.90%Sodium-independent sulfate anion transporter [Harpegnathos saltator]
Group
Gene OntologyGO:00068101.6e-52transport
GO:00550851.6e-52transmembrane transport
GO:00160211.6e-52integral to membrane
GO:00052151.6e-52transporter activity
KEGG pathway 
InterPro domain[94-378] IPR0115471.6e-52Sulphate transporter
Orthology groupMCL23298 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201153-TA
ATGGATTTCGACGATCCGAGGCAGCCGCTATTGGGTATAACAGTGGGTCTAACGTCAATACCGCAAGGGATAGCGTACGCCATTGTAGCAGGACTTCCGCCACAGGTGGGCCTGTACTCCAGTATATTCCCTGGCGTCATGTACGCTATATTCGGCAGCTGTAAGCAGGTGACTGTTGGCCCGACGGCCATATTGGCAGCGTTATTGACCAAGTACGTAGCACAATCAGAAGATTTTGCGTACTTGGCATCCTTTTTGACTGGCTGTGTTATATTACTACTTGGTGTTTTGCAATTAGGTTTCCTTTTAGATTTCATATCAAAGCCAGTTATAAGCGGGTTCACTGCGGCCGCCGCCTTGCAGATATCAGCTTCACAATTAAAATCGTTGTTCAACACGACCGGTAGTTCTGGTGGGACGTTTATAAAGGCTGTTATAAATTTCTTCTCAAATATAAAATCTGTTCAGCTGTGGGACACCTTACTAGGCGTCCTCACCATCGTATCTCTGTTTCTTCTTAAATGCTGCTCCCCCTCCTCCCCCCTCTCGTGCTGTGCCACGTGTCGCGTGCACTCGGTCCGCGCTCGTAACGCCGTAGTCGTGTTTGCGGCTACGGCCGTGGCGTACTTGTTCTACATCTACGGCATGACGCCGTTCAAACTAACCGGTAAAATAGAGGGTGGTTTGCCTAAATTCGGTCTACCTCCATTCCAGACTGTAGTAAATAATAATACTATTGGTTTTGATAAAATGTTGGATGTCTTAGGCCCGGAAGGTCTCGTGATGCCGCTAGTAGCGATACTGGAGAGTATCGCTATTGCTAAAGCCTTCGCCGGCACAGCATCAGTGGACGTCACGCAGGAGATGATAGCTGTGGGTATGTGTAACATAGTGTCGTCGTTCGCGCAGAGCATGCCGGCCACGGGCTCCTTCACACGGACCGCCCTCAACCACGCCAGCGGGGTCATGACGCCAGCCGGCTCCCTGTTTAAAGCGGCGTTAGTGTTACTGTCAGTGACGTACTTGTCCGAGGCGTTCCGCTTCATCCCTCGCTCGACCCTGGCCGGCATCATCATGGTGGCGATGGTGTCCATCGTCGACTTCTCTATTCTACCGCCACTATGGAGACACAGCAGTAAGTGGGCTCGTCCGAGGTCTGTATGGAGACGGGTAGTGTGA

Protein sequence:

>DPOGS201153-PA
MDFDDPRQPLLGITVGLTSIPQGIAYAIVAGLPPQVGLYSSIFPGVMYAIFGSCKQVTVGPTAILAALLTKYVAQSEDFAYLASFLTGCVILLLGVLQLGFLLDFISKPVISGFTAAAALQISASQLKSLFNTTGSSGGTFIKAVINFFSNIKSVQLWDTLLGVLTIVSLFLLKCCSPSSPLSCCATCRVHSVRARNAVVVFAATAVAYLFYIYGMTPFKLTGKIEGGLPKFGLPPFQTVVNNNTIGFDKMLDVLGPEGLVMPLVAILESIAIAKAFAGTASVDVTQEMIAVGMCNIVSSFAQSMPATGSFTRTALNHASGVMTPAGSLFKAALVLLSVTYLSEAFRFIPRSTLAGIIMVAMVSIVDFSILPPLWRHSSKWARPRSVWRRVV-