Monarch geneset OGS2.0

DPOGS212387
TranscriptDPOGS212387-TA663 bp
ProteinDPOGS212387-PA220 aa
Genomic positionDPSCF300019 + 820368-821421
RNAseq coverage426x (Rank: top 29%)
Annotation
HeliconiusHMEL0133853e-11294.09% 
BombyxBGIBMGA012118-TA5e-9988.84% 
Drosophilasnf-PA5e-8075.34% 
EBI UniRef50UniRef50_P085796e-8065.79%U2 small nuclear ribonucleoprotein B'' n=93 Tax=Eukaryota RepID=RU2B_HUMAN
NCBI RefSeqNP_001161808.11e-9576.62%U1 small nuclear ribonucleoprotein A [Apis mellifera]
NCBI nr blastpgi|3800243533e-9577.06%PREDICTED: U2 small nuclear ribonucleoprotein B''-like [Apis florea]
NCBI nr blastxgi|910787522e-9783.48%PREDICTED: similar to U1 small nuclear ribonucleoprotein A [Tribolium castaneum]
Group
Gene OntologyGO:00001668.9e-24nucleotide binding
GO:00036761.6e-14nucleic acid binding
KEGG pathwaytgu:1002175612e-80 
 K11094 (SNRPB2)maps-> Spliceosome
InterPro domain[4-89] IPR0126778.9e-24Nucleotide-binding, alpha-beta plait
[8-82] IPR0005041.6e-14RNA recognition motif domain
Orthology groupMCL12124 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212387-TA
ATGGACATACGGCCGAACCACACAATATATATAAACAATCTCAATGAAAAAATTAAGAAGGAGGAACTTAAAAAGTCCCTCTATGCTATATTTTCTCAATTTGGGCAGATTTTAGAAATCGTTGCACTGAAAACACTCAAGATGAGGGGTCAAGCATTTGTTATATTCAAAGAAATATCCAGTGCAACCGTCGCTTTGAGGAGTATGCAAGGATTCCCATTTTATGACAAACCTATGAGAATACAGTATTGTAAAACTGATAGTGATGTTATAGCAAAGATGAAGGGAACCTTCCAAGAACGACCTAAAAGACCAAAACTGCCAAAAGGTTCCGATGAGAAGAAGAAAAAGAACAAAGATCCCAATAGACCAAATGTGCCTGGATTTGGACAACCAAGTGTTCTTAACAATGTTAACGCTGAACAGCCACCAAACCAAATACTGTTTCTTACTAATCTTCCTGATGAAACATCAGAAATGATGTTATCTATGTTGTTTAACCAGTTCCCTGGTTTTAAGGAAGTTCGACTGGTGCCAAACAGACATGATATCGCCTTTGTGGAGTTTGCTAATGAAATGCAATCAGCGGCTGCTAAAGAAGCCCTCCAAGGCTTCAAGATCACTCCCACTCATGCTATGAAAATATCATTTGCAAAGAAATAG

Protein sequence:

>DPOGS212387-PA
MDIRPNHTIYINNLNEKIKKEELKKSLYAIFSQFGQILEIVALKTLKMRGQAFVIFKEISSATVALRSMQGFPFYDKPMRIQYCKTDSDVIAKMKGTFQERPKRPKLPKGSDEKKKKNKDPNRPNVPGFGQPSVLNNVNAEQPPNQILFLTNLPDETSEMMLSMLFNQFPGFKEVRLVPNRHDIAFVEFANEMQSAAAKEALQGFKITPTHAMKISFAKK-