Monarch geneset OGS2.0

DPOGS204743
TranscriptDPOGS204743-TA1101 bp
ProteinDPOGS204743-PA366 aa
Genomic positionDPSCF300231 - 562986-568448
RNAseq coverage132x (Rank: top 56%)
Annotation
HeliconiusHMEL0176882e-12463.99% 
BombyxBGIBMGA013703-TA4e-10777.82% 
DrosophilaCG1789-PA1e-5045.12% 
EBI UniRef50UniRef50_D6WCA42e-6152.80%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WCA4_TRICA
NCBI RefSeqNP_001040447.11e-10577.82%UTP11-like U3 small nucleolar ribonucleoprotein [Bombyx mori]
NCBI nr blastpgi|1140520223e-10477.82%UTP11-like U3 small nucleolar ribonucleoprotein [Bombyx mori]
NCBI nr blastxgi|1140520221e-11077.82%UTP11-like U3 small nucleolar ribonucleoprotein [Bombyx mori]
Group
Gene OntologyGO:00320401.4e-102small-subunit processome
GO:00063641.4e-102rRNA processing
KEGG pathway 
InterPro domain[118-366] IPR0071441.4e-102Small-subunit processome, Utp11
Orthology groupMCL13105 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204743-TA
ATGTCGTCTTGGAAGAAAGCGGCAAAAGCGAATCAGAAAACACATAAGGAAAGGCATCAGCCAGAAGCTCGGAAACACTTAGGACTACTAGAGAAGAAGAAAGATTATAAGAAACGAGCCGAGGATTACCATGAAAAGGGAGCAACGCTCAAACTTTTACGTAAACGAACCTTGGATAAAAACCCCGATGAGTTCTACTTTCACATGATTAATTCAAGAGTAAAGGATGGGGAACATCATGAATTAGAAAAAGAGGATGAAGTAACTTCCGAGCAGGTGAAATTGATGCAAACACAAGATCTTAAATATATCAATATGAAGAGAACAATTGAGAGCAGAAGAATACAGAGGATGCAGAAAGCGGCAAAAGCGAATCAGAAAACACATAAGGAAAGGCATCAGCCAGAAGCTCGGAAACACTTAGGACTACTAGAGAAGAAGAAAGATTATAAGAAACGAGCCGAGGATTACCATGAAAAGGGAGCAACGCTCAAACTTTTACGTAAACGAACCTTGGATAAAAACCCCGATGAGTTCTACTTTCACATGATTAATTCAAGAGTAAAGGATGGGGAACATCATGAATTAGAAAAAGAGGATGAAGTAACTTCCGAGCAGGTGAAATTGATGCAAACACAAGATCTTAAATATATCAATATGAAGAGAACAATTGAGAGCAGAAGAATACAGAGGATGCAGTCTCAACTCCACATGACGAACGTAGCGGACTCTACGCCCAACACTCACGTGTTCTTTGTAGAAGAAGACGAGGCAAAGAACTTTGATTTAGCCAAAAGACTAGACACACATCCCTCACTCATCAACAGGAAGTCCAACAGACCACGGCTGTCGGATCTGGATAAGATTACTCTTCCTGAAGTTAACGATGAGTTTGTCAATTCAACTAAGAAAATGAAAGAGCGTACATACAAAGAACTGTCAAAGAGAATAGACAGGGAAAAACATCTGACGGTAGTCCAACAAAAACTGGAGATCAAGAGACATTTACAAGACGCTAAAGTTATGAAGCCAAAGAGAGTCAAGCGAGGCACGGCCGTCTCAGCGCCCGTCTACAAGTTCCAGTACATGAGAAAAAAATAA

Protein sequence:

>DPOGS204743-PA
MSSWKKAAKANQKTHKERHQPEARKHLGLLEKKKDYKKRAEDYHEKGATLKLLRKRTLDKNPDEFYFHMINSRVKDGEHHELEKEDEVTSEQVKLMQTQDLKYINMKRTIESRRIQRMQKAAKANQKTHKERHQPEARKHLGLLEKKKDYKKRAEDYHEKGATLKLLRKRTLDKNPDEFYFHMINSRVKDGEHHELEKEDEVTSEQVKLMQTQDLKYINMKRTIESRRIQRMQSQLHMTNVADSTPNTHVFFVEEDEAKNFDLAKRLDTHPSLINRKSNRPRLSDLDKITLPEVNDEFVNSTKKMKERTYKELSKRIDREKHLTVVQQKLEIKRHLQDAKVMKPKRVKRGTAVSAPVYKFQYMRKK-