Monarch geneset OGS2.0

DPOGS200158
TranscriptDPOGS200158-TA1500 bp
ProteinDPOGS200158-PA499 aa
Genomic positionDPSCF300128 + 97891-99390
RNAseq coverage4x (Rank: top 88%)
Annotation
HeliconiusHMEL0201000.068.24% 
BombyxBGIBMGA002910-TA2e-14454.10% 
DrosophilaCG5002-PA2e-9839.80% 
EBI UniRef50UniRef50_E2AIV74e-11347.16%Sodium-independent sulfate anion transporter n=5 Tax=Formicidae RepID=E2AIV7_CAMFO
NCBI RefSeqXP_315178.41e-11143.11%AGAP004636-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3071775251e-11247.16%Sodium-independent sulfate anion transporter [Camponotus floridanus]
NCBI nr blastxgi|3071967511e-11648.09%Sodium-independent sulfate anion transporter [Harpegnathos saltator]
Group
Gene OntologyGO:00068103.5e-60transport
GO:00550853.5e-60transmembrane transport
GO:00160213.5e-60integral to membrane
GO:00052153.5e-60transporter activity
KEGG pathway 
InterPro domain[49-342] IPR0115473.5e-60Sulphate transporter
Orthology groupMCL10140 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200158-TA
ATGTATCTTTTTGTGGGAAGCTGCAAAGACATCACAATTGGACCTACAGCAATCATGTCGGCTGTTGTAGCAAAGTACGTAGCGAATTACTCGTCTGAATTTGCTGTATTAGCAGCGTTCCTGACCGGCGTAGTTATCATAATAATGGGTATGCTGAACCTTGGATTCTTAGTAGAATTCATATCTATACCAGTCATCAGTGGTTTTACATCCGCAGCAGCATTGCAAATAGCATCGGCTCAGTTAAAATCACTTTTTGGTCTCGATGGGTCCGCTGGCCATTATTTTTCTGAGTCTATAATTAATTTTTTTAAGAACATCACGACTTTTGTTTACTGGGATTCGTCCCTGGGTTTAATCACTATATTAATATTGGTATTATTGAAACGCCTTGGAGAAGGATGTAATCGCACAGATCCCTTGGCTAAGCAGATCAGATGGTTCATATCTCTTGCAAAGAATGCAGTTGTTGTCATATTCGGTATGGCTGTGGCGTATATTATTAAGGTGTCTACAGGTACCGAACCGATTAAGTTGATTGGAGAAATCGGCAGTGGGTTCCCTAAGATAGAGCCACCGCCATTCAGCGCTGTTGTTGGCAATCAGACCTATACGTTTTCCGATATGATGAAAGTTTTAGGTCCTGAATCTCTCATTCTGCCAATGGTTTCTATATTAGAGCTGGTTGCAATCGCTAAAGCCTTCGCAGCTGGGGGTCAGATCGATGCTACCCAAGAAATGATTGCCCTTGGATTATGCAATATGGTCGGCTCCTTTGTGAAGAGTATGCCTGTGTCTGGATCTTTTACTCGCACAGCTCTCAATAATGCTTCAGGTGTCCAAACACCACTGGGTGGAATTTTCACTGCAACACTTTTAATTCTGGCTTTAAGTTTACTCACTAAAACGTTTTACTACATTCCTAAACCGTCTTTAGCCGGTTTAATCATAACCGCTATGTTTTATATGATAGATTTCAAAATAGTAATTCGATTGTGGAAAACCAGTAAAAAAGAATTTTTTGTATATATAGCAACATGGCTAGCTAGCTTATTGTATGGACTGGAATATGGCATACTGACCGGCATATTGGCTGATGCTTTGATACTCCTGTTTGCAACAGCAAGACCTGCATGTGAAATGACGACTATTGCTGGAGATAAGAAGACGATGGTAGTGATATCGTTGCCAGAAAATTTGTCGTACTGTGCAGCTGAACATGTCCGAAGGAAAATACTAAAAGCCACGCTCGAGAGCCACCGCGTTACAAGCCTACAGATTGTAGTAAATGGAACAAATCTAAGAATCATGGATTCAACGGTTGCTACGAATTTGATGTCTATAATCAAAGATTTGGAAAAAGACTTTAACATAGTATTTCTGAATTTCAACAGCAATCTACAAAAGATGTGCAATAACGTTAATGATAAATACAGTCATATATTTGTGACAGATGTCAATGTGGATGGCATCTTTAAAGTGAAACAAAGCCCTGCCTAA

Protein sequence:

>DPOGS200158-PA
MYLFVGSCKDITIGPTAIMSAVVAKYVANYSSEFAVLAAFLTGVVIIIMGMLNLGFLVEFISIPVISGFTSAAALQIASAQLKSLFGLDGSAGHYFSESIINFFKNITTFVYWDSSLGLITILILVLLKRLGEGCNRTDPLAKQIRWFISLAKNAVVVIFGMAVAYIIKVSTGTEPIKLIGEIGSGFPKIEPPPFSAVVGNQTYTFSDMMKVLGPESLILPMVSILELVAIAKAFAAGGQIDATQEMIALGLCNMVGSFVKSMPVSGSFTRTALNNASGVQTPLGGIFTATLLILALSLLTKTFYYIPKPSLAGLIITAMFYMIDFKIVIRLWKTSKKEFFVYIATWLASLLYGLEYGILTGILADALILLFATARPACEMTTIAGDKKTMVVISLPENLSYCAAEHVRRKILKATLESHRVTSLQIVVNGTNLRIMDSTVATNLMSIIKDLEKDFNIVFLNFNSNLQKMCNNVNDKYSHIFVTDVNVDGIFKVKQSPA-