Monarch geneset OGS2.0

DPOGS200680
TranscriptDPOGS200680-TA1785 bp
ProteinDPOGS200680-PA594 aa
Genomic positionDPSCF300353 - 142995-145101
RNAseq coverage179x (Rank: top 49%)
Annotation
HeliconiusHMEL0177890.061.14% 
BombyxBGIBMGA005552-TA0.056.17% 
DrosophilaCG12301-PA7e-4035.37% 
EBI UniRef50UniRef50_D6WUM63e-10643.07%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WUM6_TRICA
NCBI RefSeqXP_973079.16e-10743.07%PREDICTED: similar to smooth muscle caldesmon, putative [Tribolium castaneum]
NCBI nr blastpgi|910879671e-10543.07%PREDICTED: similar to smooth muscle caldesmon, putative [Tribolium castaneum]
NCBI nr blastxgi|910879677e-12342.52%PREDICTED: similar to smooth muscle caldesmon, putative [Tribolium castaneum]
Group
Gene OntologyGO:00320405.6e-108small-subunit processome
GO:00063645.6e-108rRNA processing
KEGG pathway 
InterPro domain[1-572] IPR0067095.6e-108Small-subunit processome, Utp14
Orthology groupMCL11406 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200680-TA
ATGAGTTATGAAGAAATGGTTGAACACAGACAGCATCTTGCTAAGTTTAGAGCTCAACAGTCTTATAGGGCTGCCAAGGCTAAGAGACAAAGCAAAATTAAAAGTAAAAAATATCATCGTATACTTAAGAAAGAAAAATTAAAACAACAGTTAAAGGAATTTGAGGAATTACAAGCAACTAATCCAGAAGAGGCATTGAAAAAACTTGAGGAGTTAGAAAAAGCACGAGCTTTAGAAAGACATACCTTGAGACATAAAAACACTGGAAAATGGGCTAAGAACAAATTAGTTAGAGCAAAATATGATAAAGAGGTGAGGCAGCAATTAGCTGAACAACTGTCCGTAAGCAGAGGTCTTACACAGAAAACACAAAATGTTGAAAGTTCAGATGATGAGCCAGATGAAAGTGAAAACATCTGTGACATTCAAATATCACAGGATCCCATGAATCCATGGATGTTAAAGAAATCCGACAAGAGTAATATAGATGCCGAATTCAATTTCGGCTATAAAAAATATTTAAAAGACAAAATGTATAAATGCAAAGAGCAAAGTGACTCGGAAGAAGATGAAGCTGACCAAAATAGAGACAACGACATGAGTTCTCTGAAAATGCTGGCAGAGAGCTTGAAAAAATTAAACAATGGTGAAAGTACAAATGTGCTTGAAAACAAAACTCAAGACGATGTCACTTTAATGGATGTTACCAGTGAAGAAAATAAATCACTAACAGTTCTTAATAAAAATATTACACAGAAACAGAAAAGTAATAATACTGGCAGAAAAAACAAAAGAAAGATAGTCTCAACCTCAGACTGGCTTGTGGAAGAAATAAATCCCAAAAATGCAACAACTGAAGAAGATATAAATACTGCATTTGATGATTATGAAGACAAAGTAGCATTAAAGGTTGCTAAAAAACTTAAGGGTTTGAAACATGAGTTAAAAAATTTAGAATCATCATCTATCAAGCCAAATAAAAAGACAAATGAAACTAGTAAAGAAATTGACAACCTTGAATATTTAAAAATTAAAAAGCAGAAACAGATGCCTATAATTGATGAACCTCTTATAGAATCAAATAAAAATATTGATGACATACCCGAGCAGACAAAACACTTATTAGACACATTAAAAGATACAATAACAAGCACAAGCCAAAACGTAAACACAGATATTGATCCAAGTAGGTTTATTGAAGTTAAACCAAAGTATTTGAATACAGCTGTGACAAATTCCGAGAATAATTTCGACGACTTAGATGATGAAGAACAAGTGGTACCCAAAGTGGATATTGAAGAAGTTTTTGAAGAAGACGATGTGGTGACTAGTTTCAGACAAGAGAAGGAGGACGAAATTAATAAGAATAATCCAGAAGAACTAAGTTTAACACTCCCTGGATGGGGAGGCTGGGCCGGTAAAGGTGTGAAAGCACCTAAACGAAAGAAAAATAGATTTATTACAAAGAAACCACCAAAAACACTCAGAAGAGATGAAAACAAAGGTGATGTAATCATTAACGAGTCTAAAAATCCCAAGCTTGCTATACATAAAGTTTCAGATTTACCACATCCATTCAACAGTGTGAAAGAATATGAGGAAAGTATAAGAACGCCTCTAGGTAACACATTTGTGCCTGAAACAGCTCATAAGAAACTTATAAAACCTAATGTTATCACAAGATCTGGAACAATCATTGAACCGATGGATGAAGAAGAACTGCTTGTGCCAAGAAATCGTAACTTTAAAAATAAGTCTGTTATTAAGATTCTAGGCAAGCAATAA

Protein sequence:

>DPOGS200680-PA
MSYEEMVEHRQHLAKFRAQQSYRAAKAKRQSKIKSKKYHRILKKEKLKQQLKEFEELQATNPEEALKKLEELEKARALERHTLRHKNTGKWAKNKLVRAKYDKEVRQQLAEQLSVSRGLTQKTQNVESSDDEPDESENICDIQISQDPMNPWMLKKSDKSNIDAEFNFGYKKYLKDKMYKCKEQSDSEEDEADQNRDNDMSSLKMLAESLKKLNNGESTNVLENKTQDDVTLMDVTSEENKSLTVLNKNITQKQKSNNTGRKNKRKIVSTSDWLVEEINPKNATTEEDINTAFDDYEDKVALKVAKKLKGLKHELKNLESSSIKPNKKTNETSKEIDNLEYLKIKKQKQMPIIDEPLIESNKNIDDIPEQTKHLLDTLKDTITSTSQNVNTDIDPSRFIEVKPKYLNTAVTNSENNFDDLDDEEQVVPKVDIEEVFEEDDVVTSFRQEKEDEINKNNPEELSLTLPGWGGWAGKGVKAPKRKKNRFITKKPPKTLRRDENKGDVIINESKNPKLAIHKVSDLPHPFNSVKEYEESIRTPLGNTFVPETAHKKLIKPNVITRSGTIIEPMDEEELLVPRNRNFKNKSVIKILGKQ-