Monarch geneset OGS2.0

DPOGS215090
TranscriptDPOGS215090-TA1293 bp
ProteinDPOGS215090-PA430 aa
Genomic positionDPSCF300187 + 242241-243533
RNAseq coverage721x (Rank: top 18%)
Annotation
HeliconiusHMEL0105370.075.63% 
BombyxBGIBMGA007192-TA9e-16768.11% 
DrosophilaCG2972-PA1e-9040.69% 
EBI UniRef50UniRef50_E2APE06e-10947.55%RNA-binding protein NOB1 n=6 Tax=Formicidae RepID=E2APE0_CAMFO
NCBI RefSeqXP_974124.13e-11251.90%PREDICTED: similar to CG2972 CG2972-PA [Tribolium castaneum]
NCBI nr blastpgi|910832576e-11151.90%PREDICTED: similar to CG2972 CG2972-PA [Tribolium castaneum]
NCBI nr blastxgi|3800238531e-10848.08%PREDICTED: RNA-binding protein NOB1-like [Apis florea]
Group
KEGG pathway 
InterPro domain[284-355] IPR0148812.1e-27Nin one binding (NOB1) Zn-ribbon-like
Orthology groupMCL13707 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215090-TA
ATGTCTAAGAAAATAAAACATTTAGTCGTAGATACGACTGCTTTCATTCGTGCTGGTAATCTTCAAGATATTGCAAATGAAATATACACAATTCAAGAGGTAGTAGACGAAATTACTAACGACCGGCAGCGTAAAAAGCTCGTAGTACTTCCATATGATTTGAAAATAAAAGATGTATTTACTGAAAACATCAAGTTCGTTACGGAATTCTCAAAAAAGACTGGCGATTATACCAGTCTATCAGCGACCGACATAAAAGTTATGGCTCTTACATACCAAATGGAAAAAGAGTTTCTTGGTACGAATCATTTAAAATCAGAACCTACTATGCAAAGGACTGTTAAAGTCAGTGGTCTCCCTGCATTTAACAATAACACAGAAGATAAAGAACACGTATCTGAAACAAATGACCAAAAAAGTGATGGTGATAGTTCAGGGGACGTCGAAGAATACATCACTGATGACAGGAAGCAAACAAATGAAACAAATACTGAAAATAATCACGATATTTTGAAAGATGGGGATGAAAGTGACGATGACAATAAATTAGCAGAGGAGATCGCTGAGCAGATAAAAAATATGGATTTAAGAGACGAATCGGAAGTCAATGATTGTATTGTAAAAGTATCAGACGAAGAATACAGCGACAGTCAAGAAACTGAGGACGATTCCGACAGTGATGATGGAGACTGGATAACACCGGGGAACCTTAAAGAGAAGAAGAAAGAAGTGGAAGATGGAGAATTTGAGGATAAAAATGTGGAAGTTGCATGCATAACATCAGATTTTGCCATGCAGAATGTACTGAAACAAATAGGATTGAATGTGACATCTATTGATGGTCGAGTGATAAAATATCTTAAAACATTTATTTTCCGCTGCACCACATGTTTCAAGACAACAAGCATTATGACTAAAGTGTTCTGTCCCAAATGTGGCCATGCTACCCTTAAGAAGGTCTCTGTAAGTGTGGATGATGACGGCAATCAACATATACATATAAACGGAAGAAAACCCCTCACAGCCAGGGGTAAGAGGTTCAGTTTACCGACGCCTAGAGGTGGGCAACATTTCCAGTACCCTATACTAACAGAGGATCAACACATACACAAAACATTTGCCACAAAATTGGCTCGTAGCAAAACAAATGCTCTAGACCCCGACTACATCGCTGGATTTTCACCATTTGTTATGAAAGATGTTAACTCAAAATCTGCAGTCCTGGGTGTGAGGGCTAACAAACAGGATATCAAATATTGGATGAAGCATTACTCTAAAGGGAAGAAAAAGTAA

Protein sequence:

>DPOGS215090-PA
MSKKIKHLVVDTTAFIRAGNLQDIANEIYTIQEVVDEITNDRQRKKLVVLPYDLKIKDVFTENIKFVTEFSKKTGDYTSLSATDIKVMALTYQMEKEFLGTNHLKSEPTMQRTVKVSGLPAFNNNTEDKEHVSETNDQKSDGDSSGDVEEYITDDRKQTNETNTENNHDILKDGDESDDDNKLAEEIAEQIKNMDLRDESEVNDCIVKVSDEEYSDSQETEDDSDSDDGDWITPGNLKEKKKEVEDGEFEDKNVEVACITSDFAMQNVLKQIGLNVTSIDGRVIKYLKTFIFRCTTCFKTTSIMTKVFCPKCGHATLKKVSVSVDDDGNQHIHINGRKPLTARGKRFSLPTPRGGQHFQYPILTEDQHIHKTFATKLARSKTNALDPDYIAGFSPFVMKDVNSKSAVLGVRANKQDIKYWMKHYSKGKKK-