Monarch geneset OGS2.0

DPOGS204371
TranscriptDPOGS204371-TA1086 bp
ProteinDPOGS204371-PA361 aa
Genomic positionDPSCF300040 + 928898-932981
RNAseq coverage15x (Rank: top 81%)
Annotation
HeliconiusHMEL0118202e-15197.79% 
BombyxBGIBMGA005893-TA5e-12780.33% 
Drosophilabru-3-PA2e-9262.04% 
EBI UniRef50UniRef50_Q9VU912e-9062.04%Bruno-3, isoform A n=151 Tax=cellular organisms RepID=Q9VU91_DROME
NCBI RefSeqXP_971057.26e-11578.38%PREDICTED: similar to bruno-3 CG12478-PA [Tribolium castaneum]
NCBI nr blastpgi|2700168065e-12281.58%hypothetical protein TcasGA2_TC001522 [Tribolium castaneum]
NCBI nr blastxgi|2700168062e-13081.58%hypothetical protein TcasGA2_TC001522 [Tribolium castaneum]
Group
Gene OntologyGO:00001668.2e-24nucleotide binding
GO:00036765.2e-20nucleic acid binding
KEGG pathway 
InterPro domain[265-341] IPR0126778.2e-24Nucleotide-binding, alpha-beta plait
[270-343] IPR0005045.2e-20RNA recognition motif domain
Orthology groupMCL11648 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204371-TA
ATGAATGCGCCCGACTTTAGTTTCTGTATTGTAGTTCCCTTCACCTGCCTCCGGGCCGTCGCTTGCATTTTTAAGGACGCTTGGGCCTCATTTTCTGAGGCGTTGTATGCTTTGCTAATGTTCACGTTTCCCTCACTGCCTATTAAAAGTGGAGCCTCATCGAGCCTGGTAGTAAAATTCGCGGACACGGAAAAGGAACGTCAACTTCGTCGCATGCAGCAGATGGCCGGCAACATGAGCCTGCTGAACCCGTTCGTTTTCAATCAGTTCGGAGCTTATGGAACCTACGCTCAGGTCATCACCGAGCAAGTTGACCTACAGCAACAAGCGGCATTGATGGTGGCCGCCACCGCCCAAGGCTATATCAGTCCTATGACAGCGCTCGCGTCCCACGCTCTAAACGGAATGGCCAATTCCGTAGTGCCAGCTACCTCTGATAACTTCACCGGGCTCGCGATAGGCACAGGAGGTGGACAGCCACTGAATGGAGCGCTTCCATCTCTGCCATCGCCAACGATGCCAGGCTTTAACATGGCAGCTCAGACAAATGGGCAGCCACCACCGCAAGAGGCGGTATATACTAATGGCATCCACCAGACATTTACTGGACCCGTGCCAGTGACAGCGCAGGGCATCCCTAACGGCGAAGCAGCGCTGCAACACGCTGCTTACCCCAGCATGCAGCCCTTCCCCGGCGTCGCTTATCCAGCCGTTTATGGGCAGTTTCCGCAGCCCATCCCGCCGCCGATGTCGACAATAGCGCCAGCGCAAAGAGAAGGATGCTCCATTTCGGGGCCTGAGGGCTGTAACCTGTTCATATACCACTTGCCACAAGAATTCGGGGACGCCGAACTGATGCAGATGTTCCTCCCTTTCGGGAATGTAATAAGCAGCAAGGTGTTCATTGACCGTGCCACCAATCAGAGCAAATGTTTCGGCTTTGTATCGTTTGACAACCCGACGTCAGCCCAGGCCGCCATTCAAGCAATGAATGGCTTCCAGATCGGCATGAAGCGGCTAAAGAAGGCTGCCGGCAAGTACCCTTCGCCCGCGGTGCGCTTCTTTGCGTATTATCAAGAATTCTAA

Protein sequence:

>DPOGS204371-PA
MNAPDFSFCIVVPFTCLRAVACIFKDAWASFSEALYALLMFTFPSLPIKSGASSSLVVKFADTEKERQLRRMQQMAGNMSLLNPFVFNQFGAYGTYAQVITEQVDLQQQAALMVAATAQGYISPMTALASHALNGMANSVVPATSDNFTGLAIGTGGGQPLNGALPSLPSPTMPGFNMAAQTNGQPPPQEAVYTNGIHQTFTGPVPVTAQGIPNGEAALQHAAYPSMQPFPGVAYPAVYGQFPQPIPPPMSTIAPAQREGCSISGPEGCNLFIYHLPQEFGDAELMQMFLPFGNVISSKVFIDRATNQSKCFGFVSFDNPTSAQAAIQAMNGFQIGMKRLKKAAGKYPSPAVRFFAYYQEF-