Monarch geneset OGS2.0

DPOGS200156
TranscriptDPOGS200156-TA1278 bp
ProteinDPOGS200156-PA425 aa
Genomic positionDPSCF300128 + 18672-41066
RNAseq coverage47x (Rank: top 71%)
Annotation
HeliconiusHMEL0144162e-8278.70% 
BombyxBGIBMGA001619-TA2e-10284.81% 
DrosophilaCG34354-PA1e-9281.15% 
EBI UniRef50UniRef50_A8JPX01e-9081.15%CG34354, isoform A n=31 Tax=Neoptera RepID=A8JPX0_DROME
NCBI RefSeqXP_624017.11e-10192.06%PREDICTED: similar to CG12870-PA isoform 1 [Apis mellifera]
NCBI nr blastpgi|3454979855e-10189.23%PREDICTED: nucleolysin TIA-1 isoform p40-like [Nasonia vitripennis]
NCBI nr blastxgi|3454979858e-9990.16%PREDICTED: nucleolysin TIA-1 isoform p40-like [Nasonia vitripennis]
Group
Gene OntologyGO:00001663.7e-26nucleotide binding
GO:00036761.2e-23nucleic acid binding
KEGG pathway 
InterPro domain[20-107] IPR0126773.7e-26Nucleotide-binding, alpha-beta plait
[24-97] IPR0005041.2e-23RNA recognition motif domain
Orthology groupMCL15783 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200156-TA
ATGATACTCTCGGAGCATGGTGAGGATGTATGGGATTGTACTGGAGGAGTAAGAGCGAGAGAACACTACCACATCTTCGTCGGTGATCTCAGCCCGGAGATTGAGACGCAGAATCTCAGAGACGCCTTCGCCCCATTCGGCGAAATATCGGATTGTCGCGTCGTTCGTGATCCTCAGACGCTCAAATCTAAGGGATACGGTTTCGTGTCCTTCCTTAAGAAATCCGAAGCGGAGTCAGCTATAACGGCTATGAACGGCCAGTGGTTGGGGTCGAGGTCTATACGAACTAACTGGGCGACAAGGAAACCACCAGCTCCAAAAAACGAACTAAACTCAAAGCCGCTAACCTTCGACGAGGTTTACAACCAGAGCTCCCCGACCAATTGCACGGTCTACTGCGGCGGTCTTACGGCCGGGCTCACCGAGGAGCTCATGCAGAAGACCTTCCAGCCCTTCGGGACCATCCAGGAGATACGCGTCTTCAAGGATAAGGGATACGCTTTTATCAGATTCTCAACCAAAGAGAGCGCGACCCATGCTATAGTTGCTGTGCACAATGCTGATGTTAACGGCGCTCCTGTGAAGTGTTCCTGGGGCAAGGAATCCGGTGACCCGAATAACGCACAAGGAGCACAGCCGCTAACCTTCGACGAGGTTTACAACCAGAGCTCCCCGACCAATTGCACGGTCTACTGCGGCGGTCTTACGGCCGGGCTCACCGAGGAGCTCATGCAGAAGACCTTCCAGCCCTTCGGGACCATCCAGGAGATACGCGTCTTCAAGGATAAGGGATACGCTTTTATCAGATTCTCAACCAAAGAGAGCGCGACCCATGCTATAGTTGCTGTGCACAATGCTGATGTTAACGGCGCTCCTGTGAAGTGTTCCTGGGGCAAGGAATCCGGTGACCCGAATAACGCACAAGGAGCACAGTTGGGTGGCACTGCCTATAGTCCTTTCGGCGCCTATCCAGGAGGTGTTCCACCTTCATATTGGTATAACACATACCCGCAGCAACTGGGAGGTTTCCTGCAAGGAGTCCAGGGAGTGCAAGGATACTCCTATGCCGGACAATTCGCGGGCTATCAGCAACAATACATGGGCATGGGCGGCGTACAATTGCCATGGGCGCTGGGCGGAGGCGTGGGCGGTGTCGGTGGCGTCGGTGGTGTTGCTGCAATGTCTCAGCCGCCTCAAGTGCTGCACTATCCCGTACAGCACTTTCAAGTCCAGCCCATCGGTGAAGACGAGTGGCTGGCGCCGAGCCTGCTGGTGTGA

Protein sequence:

>DPOGS200156-PA
MILSEHGEDVWDCTGGVRAREHYHIFVGDLSPEIETQNLRDAFAPFGEISDCRVVRDPQTLKSKGYGFVSFLKKSEAESAITAMNGQWLGSRSIRTNWATRKPPAPKNELNSKPLTFDEVYNQSSPTNCTVYCGGLTAGLTEELMQKTFQPFGTIQEIRVFKDKGYAFIRFSTKESATHAIVAVHNADVNGAPVKCSWGKESGDPNNAQGAQPLTFDEVYNQSSPTNCTVYCGGLTAGLTEELMQKTFQPFGTIQEIRVFKDKGYAFIRFSTKESATHAIVAVHNADVNGAPVKCSWGKESGDPNNAQGAQLGGTAYSPFGAYPGGVPPSYWYNTYPQQLGGFLQGVQGVQGYSYAGQFAGYQQQYMGMGGVQLPWALGGGVGGVGGVGGVAAMSQPPQVLHYPVQHFQVQPIGEDEWLAPSLLV-