Monarch geneset OGS2.0

DPOGS206206
TranscriptDPOGS206206-TA1179 bp
ProteinDPOGS206206-PA392 aa
Genomic positionDPSCF300405 - 68747-74006
RNAseq coverage2228x (Rank: top 5%)
Annotation
HeliconiusHMEL0225743e-3880.00% 
BombyxBGIBMGA011988-TA4e-9482.65% 
DrosophilasnRNP-U1-70K-PA4e-7471.06% 
EBI UniRef50UniRef50_P171336e-7271.06%U1 small nuclear ribonucleoprotein 70 kDa n=15 Tax=Bilateria RepID=RU17_DROME
NCBI RefSeqXP_001947021.18e-8455.87%PREDICTED: similar to AGAP006755-PA [Acyrthosiphon pisum]
NCBI nr blastpgi|1936572471e-8255.87%PREDICTED: u1 small nuclear ribonucleoprotein 70 kDa-like [Acyrthosiphon pisum]
NCBI nr blastxgi|910789082e-13763.48%PREDICTED: similar to AGAP006755-PA isoform 1 [Tribolium castaneum]
Group
Gene OntologyGO:00001661.4e-18nucleotide binding
GO:00036761.1e-14nucleic acid binding
KEGG pathwayapi:1001678506e-79 
 K11093 (SNRP70)maps-> Spliceosome
InterPro domain[1-93] IPR0220238.3e-30U1 small nuclear ribonucleoprotein of 70kDa MW N-terminal
[98-217] IPR0126771.4e-18Nucleotide-binding, alpha-beta plait
[104-161] IPR0005041.1e-14RNA recognition motif domain
Orthology groupMCL11804 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206206-TA
ATGACGCAGTTTTTACCTCCAAATCTCCTCGCGTTATTCGCGGCGAGGGATCCTATACCATATTTGCCGCCGGCAGCGAAACTCCCACACGAGAAAAAGCAGAAAGGCTACGATGGTGTCGGCGCCTTTCTAAATGTCTTTGAGCATCCGTCAGAGACTCCGCCCCCCACTCGTGTGGAGACGCGCGAGGAGCGCCTGGAGCGCCGGCGACGAGAACGAGCTGAACAGACGGCATACAAGTTAGAACAGGAAATAGCTTTGTGGGATCCAACCACCAATGCCAAGGCTACAGGGGATCCATTCAAAACTCTATTTGTTGCCCGTGTTAATTACGATACCTCTGAATCCAAGCTACGTAGAGAGTTTGAGGGCTATGGACCAATTAAAAAAATCCACATGGTTTACAGTAAGGAGAACGGTAAGCCACGCGGGTATGCCTTCATTGAGTACGAACATGAGAGGGACATGCACTCCGCCCAGGCGGCCAGTGCGAGTGCGCTCCTCCAAGAAGGTCGTAGCCAGAGACCGCGTTCGGCCATCGTTACGGCTTACAAACACGCCGACGGCAAGAAGATCGACGGGAAGCGGGTGTTGGTGGACGTGGAGAGGGCTAGGACGGTGAAGGGTTGGCTGCCGAGGAGACTGGGCGGGGGTCTAGGAGGGACGCGCCGAGGCGGAGCTGACGTGAACATCAAGCACTCGGGGCGAGAAGACAACGAGCGGGAGAGGGAGAGGTACAGGCTGGAGCAGAGAGACAGAGACAGAGACGAGAGGAGGGACAGGGAGCGTGTCCGTCGCGGTCGCAGCCGTTCCAGGTCCCGCCGCCGATCAGGCTCCAGACCGAGGAGGAGAGAGAGGGAGAGAGAGAAGAGAGAGGAAGATGTGGACAGGACAAGGCGCAGGAGGTCCCGGTCACGTTCGGAGCGGCGGCGGGAACGACGGGACAGGGATAAGGATAGAGAAAAGGACAGGGACAAGGATAGGGATAAGAGGCGGAGGAGGGATCGGGACAGGAAGGAGACGAAGATCAAGCCCGAGGATATCAAGATTAAGGAGGAGCCTCAGGATGACTATCCCGAGTTCAGCCTGCCGCCCCTGGATGTCCAGATCAAGCAGGAGCCGGAAGACGAGAACAAGTATAACCCAGAAGATGCCAACGGTGATGATTATAACTATTGA

Protein sequence:

>DPOGS206206-PA
MTQFLPPNLLALFAARDPIPYLPPAAKLPHEKKQKGYDGVGAFLNVFEHPSETPPPTRVETREERLERRRRERAEQTAYKLEQEIALWDPTTNAKATGDPFKTLFVARVNYDTSESKLRREFEGYGPIKKIHMVYSKENGKPRGYAFIEYEHERDMHSAQAASASALLQEGRSQRPRSAIVTAYKHADGKKIDGKRVLVDVERARTVKGWLPRRLGGGLGGTRRGGADVNIKHSGREDNERERERYRLEQRDRDRDERRDRERVRRGRSRSRSRRRSGSRPRRREREREKREEDVDRTRRRRSRSRSERRRERRDRDKDREKDRDKDRDKRRRRDRDRKETKIKPEDIKIKEEPQDDYPEFSLPPLDVQIKQEPEDENKYNPEDANGDDYNY-