Monarch geneset OGS2.0

DPOGS203088
TranscriptDPOGS203088-TA1743 bp
ProteinDPOGS203088-PA580 aa
Genomic positionDPSCF300228 - 29659-35965
RNAseq coverage562x (Rank: top 23%)
Annotation
HeliconiusHMEL0023493e-16976.12% 
BombyxBGIBMGA002324-TA0.082.17% 
DrosophilaCG7720-PC3e-14948.53% 
EBI UniRef50UniRef50_Q7K3Q24e-14748.53%CG7720, isoform A n=33 Tax=Endopterygota RepID=Q7K3Q2_DROME
NCBI RefSeqXP_001844105.19e-16752.58%sodium/solute symporter [Culex quinquefasciatus]
NCBI nr blastpgi|1700324722e-16552.58%sodium/solute symporter [Culex quinquefasciatus]
NCBI nr blastxgi|1571677134e-16453.78%sodium/solute symporter [Aedes aegypti]
Group
Gene OntologyGO:00160205.9e-150membrane
GO:00068105.9e-150transport
GO:00550855.9e-150transmembrane transport
GO:00052155.9e-150transporter activity
KEGG pathway 
InterPro domain[3-550] IPR0017345.9e-150Sodium/solute symporter
[8-395] IPR0199001.8e-56Sodium/solute symporter, subgroup
Orthology groupMCL11163 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203088-TA
ATGATATTGTCCGTATCAAGAGGAACATTAGGAGTTCGCTCGTTTTTGGGTTTTCCTTCTGAATTAGTGTACTTTGGTTCAGCAATGTGGGAAACTCTTTACGGCATGGTCTTAGCTTTTCCTCTCGTTTGCTGGATCTTCATACCGGTATACTACAGACTCAGTACGAATTCTGTCTATGAATATTTGCAGATGCGTTTCGGTTCAAGATGGGTTCGTCGTCTCGCCGCGGCTACATTCCTACTCCGTCAGACTTTGAACCTTGCTATCACAGTCTACACTCCGAGTGTTGCTCTACACGCTGTCCTTGCTTTACCACATTGGGCATCAGCAGCTGCTCTAACTATTGTTGCGATAGTATTTAATTTGTTGGGCGGATTAGCTGCAGCTATCAGAGCAGACGTGATCCAGACCCTTACAATGGTGCTTGTCAGCGGTGCCTTTATTATCCAGGCGACAGTGCAAGCTGGGGGACCAGCCCAGGTCATATCAGACAATGTCGAAGGAAGACGACTGGAATTTCTGAAGTTTACATGGGACCCAACAGTCCGAGTGGATACTCTTTCTGCATTAGTTGGTCAAATGTTCATGTCCGTATCAATATACGGCTGTCAGCAGACCTTCGTCCAACGTTACTGCTCTATGTCCAGTGAAAAAAGAGTCAAGAGGACACTTTTAGCTAATGTGCCAGCAGTATTGATACTCTTCAGTCTGTCATGGGTAGTCGGAATGGCATTGTATGCGGTTTATAAATATTGTGATCCGTATATGTCGGGAAAAATAGTAGCTAAAGATGAAGTTCTGCCATTTTATGTGCAAGATCAATTTACTTTTTTGCCGGGAATGTTGGGACTGTTTTTGGGGAGCATCTTTAATGGAGCTTTAAGTTTCCTGGTATCTAATATGAATTCTCTGTCAACGGTAACTTGGGAAGACTTTGTATCAGAGGCACCGGCTTTTAAAGGCATCAGTAACAAACAACAATTGACAGTCATTAAAATAATTGGAATTATATATGCTTTAACAATAATGTCATTATCACTCTGCGTGGGTTTGGTCGGAGGTGTAATAGAGGGATCACTCTTAGTAACTTCCGCTACCTCTGGAGCTTTACTCGGAGTATTTCTATTAGCAGCTATATTTCCTGTTGCAAACGGAAAGGGAGCTTTGTGTGGTATGATTGGATCCCACTTTATCACATCATGGATGGCCATTGGAAGAATTCTATACGTCAATGATAAAAAATCTATGCTACCGTTATCTGTTCAGAAATGTCCCAACAGCACATTAAGCGTGTTGGCTCATAAACCACTCTTAGTTAGTCAAAATTCAACAATCGCGCAAATAGTGAATGTTTTAGACAAACACGTTTATAATAAAGGAAAGGATCAATCGCCCATTTTCGCTGCTCTTATGCGGATGTATTCAGTGTCATACATGTGGTACGCTGTCATAGGTACTGTTACTTGCATAATATTAGGAGTTACCATTGGTTTGCTAACGGCCAGTGAAAGCGACGCCTTTGACGAGAAACTCCTTCATCCATTGGTGGCAAAAATTACTAGAAAAATGCCAGGCAAAAAGAGAACCTTCACTAATATTATTAAAGAGAAAGTAGTTGAGGTGCAGGAAGTTGAGAAAATAGAAGAAACGACGCTGGAAGTGAAATCAGTGTTATCTTCTAATGGCAGTAGATTATTCGATGCTTATGATAATAACTTGCAATCAAGAACCAGACTTTGA

Protein sequence:

>DPOGS203088-PA
MILSVSRGTLGVRSFLGFPSELVYFGSAMWETLYGMVLAFPLVCWIFIPVYYRLSTNSVYEYLQMRFGSRWVRRLAAATFLLRQTLNLAITVYTPSVALHAVLALPHWASAAALTIVAIVFNLLGGLAAAIRADVIQTLTMVLVSGAFIIQATVQAGGPAQVISDNVEGRRLEFLKFTWDPTVRVDTLSALVGQMFMSVSIYGCQQTFVQRYCSMSSEKRVKRTLLANVPAVLILFSLSWVVGMALYAVYKYCDPYMSGKIVAKDEVLPFYVQDQFTFLPGMLGLFLGSIFNGALSFLVSNMNSLSTVTWEDFVSEAPAFKGISNKQQLTVIKIIGIIYALTIMSLSLCVGLVGGVIEGSLLVTSATSGALLGVFLLAAIFPVANGKGALCGMIGSHFITSWMAIGRILYVNDKKSMLPLSVQKCPNSTLSVLAHKPLLVSQNSTIAQIVNVLDKHVYNKGKDQSPIFAALMRMYSVSYMWYAVIGTVTCIILGVTIGLLTASESDAFDEKLLHPLVAKITRKMPGKKRTFTNIIKEKVVEVQEVEKIEETTLEVKSVLSSNGSRLFDAYDNNLQSRTRL-