Monarch geneset OGS2.0

DPOGS212276
TranscriptDPOGS212276-TA2460 bp
ProteinDPOGS212276-PA819 aa
Genomic positionDPSCF300077 + 33915-43038
RNAseq coverage205x (Rank: top 47%)
Annotation
HeliconiusHMEL0066560.074.22% 
BombyxBGIBMGA011437-TA1e-13763.29% 
DrosophilaCG8273-PA2e-7436.27% 
EBI UniRef50UniRef50_E2AX882e-9746.47%SON protein n=4 Tax=Camponotus floridanus RepID=E2AX88_CAMFO
NCBI RefSeqXP_396370.32e-10550.45%PREDICTED: similar to CG8273-PA [Apis mellifera]
NCBI nr blastpgi|910937893e-9749.43%PREDICTED: similar to SON DNA-binding protein [Tribolium castaneum]
NCBI nr blastxgi|3071691553e-12338.10%SON protein [Camponotus floridanus]
Group
Gene OntologyGO:00056221.3e-19intracellular
GO:00037251.3e-19double-stranded RNA binding
GO:00037233.9e-16RNA binding
GO:00036766.3e-10nucleic acid binding
KEGG pathway 
InterPro domain[748-816] IPR0011591.3e-19Double-stranded RNA-binding
[747-816] IPR0147203.9e-16Double-stranded RNA-binding-like
[674-714] IPR0004676.3e-10D111/G-patch
Orthology groupMCL12538 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212276-TA
ATGACTAGTTTAATAGATAAGATTGAAATGTTGATAAAGCGAGAGAAAACAACACCTGACCGCAAAAAAGACGATACAACAAAGTCATCAAGTGAAATACTCTCTGAACTTTTTAGTGCTTTTAATGCGGACCCTCCGAAAATTGATGATATTGCCTTTAAAAAGGCCAAGAAAAGCAAGAAAAAACATAAAAAGGAAAAGAAAAAACGCAGTCGAAGTGCCAGTGTTAGTAGTGATAGTGATTACAGTAGGAAACGCAAAAAGCGAAAAAAGAGTAAGTCTAAGAAACGTAAGGAGGATCGCTCACCCTCACGTCGTTATTCAAAATCACCTATATCTAAAAGGGAGTTAAAGGTAAAGATTAAAAACGAACTGGATAAGATAAATCAAGAAGCACCGGTCAAAGTAAAAGAAGAATTGAAAATCAAAACGGAACCGGTTAAGACAGAGGTGATTAAGGATGATACGAACGACTCTTGTATAGTACTTGATGATGACATTGATGCAAGTGAAATCCCCATGCCGGAATCTCCGATCGATAAGCTTAAATCAGATCTCAGGTCGAAACTAGATAAAAAACAAGAAAGTGTAACAGACAATGACAAAACAAAAAGTAAAATACAAATTAAGAATCTGAAATTTAGTACAGTTTTCGAAGAGACTGTGAAAAAGGCTGAAGAAGAAGCTAAGAAGAAAAAAGAGAAGTTGGAAGAGGGGGAATATACAGATTCCTCAAGTAGTTCAGACGGGGATGAAGTATCTAACTCACAGTTCCCGAAGTCATCAGATCTAGTCGGAATATTAAGGCAACAGGCTGCAGAAGAAACTAAATTAATTGAAGAGCAAAAAACTGTGAATAGAGTGAAAACAACAGAAAAATCACATAAAACTTCACAAAGAGATAGGAGTAGATCAAGAAAGCGAATTATATTTGAAGGTCAACCAGTAGAAAACGTTCCAAATCCCGTTCAACGTCACGGCACAGAAGACGACACAGAAGTCGGTCCAAGTCCCATCACAGAACCCGGTCGAGATCCAGACGGAGTCATTCGAGACATCGATCCAGATCCAGAAGATCCAGATCGAGACACCGATCCTCATCACGATCCAGACATCGCTCGTCATCAAAATCCAGACACCGGTCCCGACACTCTTCCAGACATCGGCGGAGGTCTAGGAGTCCGTCTAGCCGATAGTGAGAAGAAGCGTCTGTTAGAGGTAGCTCGACGGAACGCCATTAACATGTTGAAGAACGGAGCTGTACCGGCTGGAGCCGCCGCCCTCCCGCCCCATACACGGAGTCAGGTCATGGCCGCTATACAGTCTGGAGGTAAATCGGTTGATGAACTGACAGATTTCTGTAAGCATTTATCAAAAAAGGAAGCTCTGGGTGAACTATCATCTGTGTCATCCAATGATGAAGATATGTCGGAGAATGAAGACACAATAGCCTTCCATCATCCGTTCCTGGTGAAGGAAAAAGCGCCCATCATTATGAATATAAGGGGTGGAGCTCCACTGCCCACAAAAACCAGCATCCCTGTGGCAAACAAAGACGAACTCCGTCTACAGTTCCCTGTCTCATCGGGTACACAGCACAGAGAGAAAGCTGATACAACGGACAATGAAGATATGCAACTGGCGGTCATAAACACGAAACCATCGAGTCCATTGGCTAAAACACTGGAACCGATCGCTCTTCCCGCTCCCAAACCCGACTATTTAAAAGTTTTCACAGGTGGAAACAGTCTCAGCACTGTTCCTGCATTGCCAGCCGCTACATCAACTATAGACAAGGCTAAGGATGTTGCATCAATAGTGTCAGAGAAGTTGTCTTTAATAAGAAAAGAACAAGAGAATTATGAAGTTACGCCTACACATGGGTTCGGCTTCAAGAGTTCATCCCTGGGACAATTCACGGGGTCCACTGGAGCTCACATACTTACCCCACGAGAACTAGCCAGCGGGGCACAAGCCTGGGCTAAGAAGGATCAGCTGGTACGAGCTGCTCCCGTTGAAGGTGGGATGGGAATGCATCTGCTACAGAAAATGGGTTGGACTCCCGGTCGGGGGTTAGGCAAAGAGGGCACTGGGGCGTTACAACCCCTATTGCTAGAGGTGAAGCTGGACACGAGGGGTCTAACATCTAAGGAGGAGGTCAGTAACAGGCGTGGAAGACACGTGAAACCGCAGCGTATAGGTCGTCGTGGCCCGGCGCCGCTCGTGGCTGGAGGGAAGCACCCTGTATCACTTCTCGGTGAATACTGTTCCAAACAGAAACTAGGACCTCCTGAGTACAGTCTTTGCTTCGAATGTGGACCAGATCATAAAAAGAATTTCCTATTCAAGGTAAAAGTAGCGGGTATAGAATATCAGCCGGCCGTAGCCAGTGCTAACAAGAAACAAGCTAAGGCCGACGCTGCTCAACTAGCGTTGCAAAAACTAGGAATTGTAACATAA

Protein sequence:

>DPOGS212276-PA
MTSLIDKIEMLIKREKTTPDRKKDDTTKSSSEILSELFSAFNADPPKIDDIAFKKAKKSKKKHKKEKKKRSRSASVSSDSDYSRKRKKRKKSKSKKRKEDRSPSRRYSKSPISKRELKVKIKNELDKINQEAPVKVKEELKIKTEPVKTEVIKDDTNDSCIVLDDDIDASEIPMPESPIDKLKSDLRSKLDKKQESVTDNDKTKSKIQIKNLKFSTVFEETVKKAEEEAKKKKEKLEEGEYTDSSSSSDGDEVSNSQFPKSSDLVGILRQQAAEETKLIEEQKTVNRVKTTEKSHKTSQRDRSRSRKRIIFEGQPVENVPNPVQRHGTEDDTEVGPSPITEPGRDPDGVIRDIDPDPEDPDRDTDPHHDPDIARHQNPDTGPDTLPDIGGGLGVRLADSEKKRLLEVARRNAINMLKNGAVPAGAAALPPHTRSQVMAAIQSGGKSVDELTDFCKHLSKKEALGELSSVSSNDEDMSENEDTIAFHHPFLVKEKAPIIMNIRGGAPLPTKTSIPVANKDELRLQFPVSSGTQHREKADTTDNEDMQLAVINTKPSSPLAKTLEPIALPAPKPDYLKVFTGGNSLSTVPALPAATSTIDKAKDVASIVSEKLSLIRKEQENYEVTPTHGFGFKSSSLGQFTGSTGAHILTPRELASGAQAWAKKDQLVRAAPVEGGMGMHLLQKMGWTPGRGLGKEGTGALQPLLLEVKLDTRGLTSKEEVSNRRGRHVKPQRIGRRGPAPLVAGGKHPVSLLGEYCSKQKLGPPEYSLCFECGPDHKKNFLFKVKVAGIEYQPAVASANKKQAKADAAQLALQKLGIVT-