Monarch geneset OGS2.0

DPOGS202076
TranscriptDPOGS202076-TA1593 bp
ProteinDPOGS202076-PA530 aa
Genomic positionDPSCF300116 - 489839-500885
RNAseq coverage227x (Rank: top 44%)
Annotation
HeliconiusHMEL0023602e-14058.17% 
BombyxBGIBMGA002910-TA1e-10045.24% 
DrosophilaCG5002-PA2e-9138.70% 
EBI UniRef50UniRef50_E1ZW298e-10042.07%Sodium-independent sulfate anion transporter n=7 Tax=Camponotus floridanus RepID=E1ZW29_CAMFO
NCBI RefSeqXP_001602717.14e-10442.52%PREDICTED: similar to sulfate transporter, partial [Nasonia vitripennis]
NCBI nr blastpgi|3454879805e-10442.55%PREDICTED: sodium-independent sulfate anion transporter-like [Nasonia vitripennis]
NCBI nr blastxgi|3071967512e-10343.68%Sodium-independent sulfate anion transporter [Harpegnathos saltator]
Group
Gene OntologyGO:00068106e-34transport
GO:00550856e-34transmembrane transport
GO:00160216e-34integral to membrane
GO:00052156e-34transporter activity
KEGG pathway 
InterPro domain[242-371] IPR0115476e-34Sulphate transporter
Orthology groupMCL23298 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202076-TA
ATGTGTAACGTGGAATCATGGAGACGAAGGGTTCCTATCACCATTTGGCTGCCTCAGTACAATCTGGAGAAGTTGCTGCGTGATGCCATCGCTGGTATAACGGTGGGTCTAACGTCAATACCGCAAGGGATAGCGTACGCACTTGTAGCAGGACTTCCGCCACAGGTGGGCCTGTACTCCAGTATATTCCCCGGCGTCATGTACGCTATATTCGGCAGCTGTAAGCAGGTGACTGTTGGCCCGACGGCCATATTGGCAGCGTTATTGACCAAGTACGTAGCACAATCAGAAGATTTTGCGTACTTGGCATCCTTTTTGACTGGCTGTGTTATATTACTACTTGGTGTTTTGCAATTAGGTTTCCTTTTAGATTTCATATCAAAGCCAGTTATAAGCGGGTTCACTGCGGCCGCCGCCTTGCAGATATCAGCTTCACAATTAAAATCGTTGTTCAACACGACCGGTAGTTCCGGTGGGACGTTTATAAAGGCTGTTATAAATTTCTTCTCAAATATAAAATCAGTTCAGCTGTGGGACACCTTACTAGGCGTCCTCACCATCGTATCTCTGTTTCTTCTTAAATGCTGCTCCCCCTCCTCCCCCCTCTCGTGTTGTGCCACGTGTCGCGTGCACTCGGTCCGCGCTCGTAACGCCGTAGTGGTGTTCGCGGCTACGGCCGTGGCGTACTTGTTCTACATCTACGGCATGACGCCGTTCAAACTAACCGGCCCGGAAGGTCTCGTGATGCCGCTAGTAGCTATACTGGAGAGTATCGCTATTGCTAAAGCCTTCGCCGGCACAGCATCAGTGGACGTCACGCAGGAGATGATAGCTGTGGGTATGTGTAACATAGTGTCGTCGTTCGCGCAGAGCATGCCGGCCACGGGCTCCTTCACACGGACCGCCCTCAACCACGCCAGCGGGGTCATGACGCCAGCCGGCTCCCTGTTTAAAGCGGCGTTAGTGTTACTGTCAGTGACGTACTTGTCCGAGGCGTTCCGCTTCATCCCTCGCTCGACCCTGGCCGGCATCATCATGGTGGCGATGGTGTCCATCGTCGACTTCTCTATTCTACCGCCACTATGGAGACACAGCAAGTCGGAACTGTTCGTCTGGTTTCTGACCGTCGTGGTGGGTGTGACAGCCGGTCTGGAGTACGGGATAGCGGCCGGCGCGGCGGGTGACGCTCTAAGAGTGCTGTACTCCGCCTCCCGACCGCGGCTGGTGACTAAACAGTTCAAGGTGAGGTATCAGGTAGGCGCGGTGGATGTGGCGCTGGTGGCTCTACCGGAAGTTGTTTGTTACGCGAGTGTGGAGCACGTGGGGAGAGTGCTGAGGACCGTCCGCGGGCACGTTGTTGTATTGGACGGAAGGACCAACATACACGCGGATGTAGGAGTTATAGAGACCGTGAACTCTGTGATCTCTGACGCCGAGGGTGGAAAGCGTCGTTTTCTTCTGTGGTGTGTTGAGGAAGGTCACGGCTTCGACGGAATCAACGCGACCCTCGTCACCGGAGATGACCTCCAACGAATAGTCACAGAACACTGTATTGATAACAGAAGAGTATCTTCACTAAGCGGTGTGAGCTGA

Protein sequence:

>DPOGS202076-PA
MCNVESWRRRVPITIWLPQYNLEKLLRDAIAGITVGLTSIPQGIAYALVAGLPPQVGLYSSIFPGVMYAIFGSCKQVTVGPTAILAALLTKYVAQSEDFAYLASFLTGCVILLLGVLQLGFLLDFISKPVISGFTAAAALQISASQLKSLFNTTGSSGGTFIKAVINFFSNIKSVQLWDTLLGVLTIVSLFLLKCCSPSSPLSCCATCRVHSVRARNAVVVFAATAVAYLFYIYGMTPFKLTGPEGLVMPLVAILESIAIAKAFAGTASVDVTQEMIAVGMCNIVSSFAQSMPATGSFTRTALNHASGVMTPAGSLFKAALVLLSVTYLSEAFRFIPRSTLAGIIMVAMVSIVDFSILPPLWRHSKSELFVWFLTVVVGVTAGLEYGIAAGAAGDALRVLYSASRPRLVTKQFKVRYQVGAVDVALVALPEVVCYASVEHVGRVLRTVRGHVVVLDGRTNIHADVGVIETVNSVISDAEGGKRRFLLWCVEEGHGFDGINATLVTGDDLQRIVTEHCIDNRRVSSLSGVS-