Monarch geneset OGS2.0

DPOGS202136
TranscriptDPOGS202136-TA1572 bp
ProteinDPOGS202136-PA523 aa
Genomic positionDPSCF300193 - 58321-60156
RNAseq coverage283x (Rank: top 39%)
Annotation
HeliconiusHMEL0146250.081.65% 
BombyxBGIBMGA001509-TA0.074.62% 
DrosophilaCG3071-PA3e-12442.75% 
EBI UniRef50UniRef50_F4WRW13e-15349.33%U3 small nucleolar RNA-associated protein 15-like protein n=6 Tax=Neoptera RepID=F4WRW1_ACREC
NCBI RefSeqXP_973650.13e-16353.19%PREDICTED: similar to U3 small nucleolar RNA-associated protein, putative [Tribolium castaneum]
NCBI nr blastpgi|910765285e-16253.19%PREDICTED: similar to U3 small nucleolar RNA-associated protein, putative [Tribolium castaneum]
NCBI nr blastxgi|910765283e-16153.19%PREDICTED: similar to U3 small nucleolar RNA-associated protein, putative [Tribolium castaneum]
Group
Gene OntologyGO:00055155.7e-57protein binding
GO:00057306.5e-29nucleolus
GO:00063646.5e-29rRNA processing
KEGG pathway 
InterPro domain[42-325] IPR0110465.7e-57WD40 repeat-like-containing domain
[44-324] IPR0159431.8e-55WD40/YVTN repeat-like-containing domain
[344-490] IPR0189836.5e-29U3 small nucleolar RNA-associated protein 15, C-terminal
Orthology groupMCL14765 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202136-TA
ATGGCACATATATCTTATCCGTTTAAAAAAACAAACAAAGCTGTATATCAAAAACCTGCCTCTGTTTTAACTGAGGATACCCTTTATTGGAAGAAACTTGGGCTTCCAGTGCTAGTTAAAGAATTTGGAGCTATAGACTATCTGGATTTCAGCCCAGTAGAGCCCTATTATTTTGCTGCGACATGCTCTGTCAGAGTACAGGTTTATGACCCTATTTCAAAAGTGGTGGCAAAGAATATATCAAAATTTGTTGAGGCCGCATATGGAGGTTGCTTCAGAAACGATGGAAGGTTACTTGTTGCTGGTAGTGAAGAATCTGTTGTAAAATTGTTTGATGTCCAATCAAAAAATGTGTTGAGAGTGTTTACAGGTCACACTGGTCCGGTACATAGAACATTCTTTACTAAAGATCAAGTAAAAGTCCTTAGTTTCTCTGATGACAAATCAGTTTGTTTATGGGATATTGCAACTGAAGAAAAAATTGCAAGCTTCTGTGAACACACGGACTATGTGAGGGCTGGTGCTCCAAGTCCAATTTTACCGGACATCATATTATCTGGTGGTTATGACCATGCTGTTAAATTATATGACTGTCGATCAAATGAAACGGTTCTCACAGTTGACCATGGCAGTCCAGTTGAATCTACATTATTTTTACCATCTGGAGGAATATTTATTAGTGCTGGCGGCACTGAGATCAAAGTATGGGATATATTTAACGGTGGAAAATTACTTGCAAATATTTCACAGCACCACAAAACTGTTACAACTTTAAGGCTAGCTAGCAATAATAGCAGACTTATGTCAGCATCGCTTGATAGGCATGTAAAGATTTATGATCTAGCCACATTCAAAGTGGTCCACAATATTGACTTCCCGAATGCTGTGTTAAGTATGGCAATATCAGAGTGTGATGATGTGCTAGCAGTTGGAATGATTGATGGGGTTATTTCTATAAGAAAGAGAGAACAACCGGCATCATTGTTTGAAGAAAAGAGGGGACTATTCAAATTTGCCCCCGACCACATAACAGCAGAAACTGTAGATGAGGTTGTTTCAAAACAAAAGATCGAAAAGGGACCTGATTACGACAAATTTCTAAGAAAAATGGAATTCAGCAAAGCACTATCAGCTGTATTGAAAACTTATGTAGCAACTAAATTTCCCGAAAAAACCATCGCTCTGATGCAAGAAATGCTGAGAAGGAAAGTCCTACATTCAGCCATAAAAGGAATAAAAGAAAACGAAGTAGGAGCACTTCTGAAATTCTTTAAAAAAAATTTAGGTGAAACAAGATTCACAAGGACTATTATCGATGCTACTAATGTTTTCATTGATGTTTTTGAAAATGAAATCAAACTATTTTCTGAACAAAATTTATGCCTTTACAAGTCATTGCTAGAAGAAATTAAAGAGGAAATAGAAGTCTGTAAAAGAGTTAGTGAGCTTGAAGGTGCCATTGGACTTATTCTATCTGGTGCTCAAGTTGGTACAAGCCAAGATATTATAGAACTAAATGACAATATGGCTCCGTCAGCTAAAGCTCGGAAAGATATTGTAATAGATGTTTGA

Protein sequence:

>DPOGS202136-PA
MAHISYPFKKTNKAVYQKPASVLTEDTLYWKKLGLPVLVKEFGAIDYLDFSPVEPYYFAATCSVRVQVYDPISKVVAKNISKFVEAAYGGCFRNDGRLLVAGSEESVVKLFDVQSKNVLRVFTGHTGPVHRTFFTKDQVKVLSFSDDKSVCLWDIATEEKIASFCEHTDYVRAGAPSPILPDIILSGGYDHAVKLYDCRSNETVLTVDHGSPVESTLFLPSGGIFISAGGTEIKVWDIFNGGKLLANISQHHKTVTTLRLASNNSRLMSASLDRHVKIYDLATFKVVHNIDFPNAVLSMAISECDDVLAVGMIDGVISIRKREQPASLFEEKRGLFKFAPDHITAETVDEVVSKQKIEKGPDYDKFLRKMEFSKALSAVLKTYVATKFPEKTIALMQEMLRRKVLHSAIKGIKENEVGALLKFFKKNLGETRFTRTIIDATNVFIDVFENEIKLFSEQNLCLYKSLLEEIKEEIEVCKRVSELEGAIGLILSGAQVGTSQDIIELNDNMAPSAKARKDIVIDV-