Monarch geneset OGS2.0

DPOGS203617
TranscriptDPOGS203617-TA1803 bp
ProteinDPOGS203617-PA600 aa
Genomic positionDPSCF300063 + 216502-230841
RNAseq coverage44x (Rank: top 71%)
Annotation
HeliconiusHMEL0173212e-8950.39% 
BombyxBGIBMGA012800-TA7e-6632.63% 
DrosophilaIndy-PB2e-8834.36% 
EBI UniRef50UniRef50_E2AWA15e-8935.00%Protein I'm not dead yet n=5 Tax=Camponotus floridanus RepID=E2AWA1_CAMFO
NCBI RefSeqXP_001985665.12e-9134.57%GH17192 [Drosophila grimshawi]
NCBI nr blastpgi|1950229344e-9034.57%GH17192 [Drosophila grimshawi]
NCBI nr blastxgi|1571244314e-9232.89%sodium/dicarboxylate cotransporter, putative [Aedes aegypti]
Group
Gene OntologyGO:00160205.8e-42membrane
GO:00550855.8e-42transmembrane transport
GO:00068145.8e-42sodium ion transport
GO:00052155.8e-42transporter activity
KEGG pathway 
InterPro domain[48-425] IPR0018985.8e-42Sodium/sulphate symporter
Orthology groupMCL25330 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203617-TA
ATGGCTGAATATGCAAGTGATCCTGACAATATATCTGAAAATATTGGACGAAATGAAGAGGAAGATCAGTTTTCCCCGGACGACGAACTGCTGACAATAAAAAAGAAATTTATTTATTATTTCTGCTTTCATTGGCGGGGATTAATATGTACATTTTTGCCCATTATTTTAATACCTATACAGTTATCGTTTCCGCCTAAGGGATATGAGATGTGTGCTTACACTCTTGAGTTGATGGCCATATTTTGGGTAACGGAATGTATTCCTCTAGCGGTCACATCTTTTCTGCCAGTAGTGATTTTCCCTCTTACCGGAATAATGAGTACAGCAGAAACTTGTGCTTGTTACATGAATGATTCTATTATGATGTTCCTAGCTAGCATGTGGCTTGCATATGCTATAGAACAATCAGGATTACATATAAGATTAGCGCATCATGTCATTCGTTGTGTGGGCTATTCTCATTACAAGTTATTGTTTTCTATGTCCTTTACTACAATGTTTGTCTCGATATGGATAACAAATACCGCTGCTACAACAATGATGGTGCCCATAAATTTTGCAATACTGAAAGTTTTTGAGGATCAAAACTTGCTTCAAATTTACGACGAAGGATCTCATGGGGAGAAAGTGGCGTCAGACATTGCAACTTGCTATTTCTGTGCCGCTACTTTATCAGCAAGTATAGGTGGAATTGGATCTTTAGTGGGAACCGCTACTAACTTGGTTTTTAAAGAATTATTTACAAGACTTTACCCAGATGCTCCAGAATATTTGTCATTTCCAAAATTTTCCGCTTTTACAATACCTTTTATGGTGATAATGGAATTATTTTGCTACTTATATCTTATCATCGTCTATTTTGGATATTTGAGGCCAAAAAGTAAAGCCGCGAGGAATGCTGAAATAACAGAATCGGGAATGGAAGCAGCGAAACAAGCTGTTGAAGAAAAAATTAAAGAAATAGGTTCAGTAACTACGTGGGAAATTTTTGTCTTGATACTCTTTACCGGAGCAATACTGTTATTTTTTTCCCGTTCACCTCAAATCTTTGTTGGTTGGGGAGATGCTATTTCCAATTATTTTGGAATTGAAGATGTCAAATTTGTCCGAGATTCAGCAGCTGCTATTTTCGTAGTGTTTTTGATGCTGCTCATTCCGTCATCACTAAAATTTTATGACAATTTTAGGGCAAAAGCTTCTAAATGCCCGATATTTGGTGAATATTGTGAGTTAAAGAGAAACGTTTTACCAACTTACGATGATGTTGTCAAACATTACGAATGGACGCGGCATATTATGAAAACAAGCAAAAAAGAGCCCAGATTTTCAGAAATTGCTGATAAAATCGTACCAGAAATCGAAAGTGGTGGTTTCGCTCTTTCTAAAGCTGCAAAAAGTGACTATACCGATCTCAATGGAAAAATAGGAAATATTCTCAAGAACTTTAAATATCTGCCGAATCCATTAGTACTTTTGTTAATAATTATCGTAACTGTATTCATAACAAACTTTGCTTCAAATGTTGCAGTTTGTAATGTTATAGTTCCGATTGCAATGCAGTTGGCGAGGGAAATTAATAGAAGTCCGCTTTGGTACAACCTTGCCGCTGGTTTATCTTCATCTTATGCATTTTGCGTACCCGTTGGGACGCCAGGGAATCTCATAGTTCAAGGAGCGGCCAATATACCAAGTTCAAAAATGATAAAGGCCGGTATTGGAGTTACGTTTTCCAGTATCTTTGTAACATGGTTAGCCGTTTGTTTTTGGGCTCCAATTATATGGCCTGACTTATCTACATGA

Protein sequence:

>DPOGS203617-PA
MAEYASDPDNISENIGRNEEEDQFSPDDELLTIKKKFIYYFCFHWRGLICTFLPIILIPIQLSFPPKGYEMCAYTLELMAIFWVTECIPLAVTSFLPVVIFPLTGIMSTAETCACYMNDSIMMFLASMWLAYAIEQSGLHIRLAHHVIRCVGYSHYKLLFSMSFTTMFVSIWITNTAATTMMVPINFAILKVFEDQNLLQIYDEGSHGEKVASDIATCYFCAATLSASIGGIGSLVGTATNLVFKELFTRLYPDAPEYLSFPKFSAFTIPFMVIMELFCYLYLIIVYFGYLRPKSKAARNAEITESGMEAAKQAVEEKIKEIGSVTTWEIFVLILFTGAILLFFSRSPQIFVGWGDAISNYFGIEDVKFVRDSAAAIFVVFLMLLIPSSLKFYDNFRAKASKCPIFGEYCELKRNVLPTYDDVVKHYEWTRHIMKTSKKEPRFSEIADKIVPEIESGGFALSKAAKSDYTDLNGKIGNILKNFKYLPNPLVLLLIIIVTVFITNFASNVAVCNVIVPIAMQLAREINRSPLWYNLAAGLSSSYAFCVPVGTPGNLIVQGAANIPSSKMIKAGIGVTFSSIFVTWLAVCFWAPIIWPDLST-