Monarch geneset OGS2.0

DPOGS207243
TranscriptDPOGS207243-TA1704 bp
ProteinDPOGS207243-PA567 aa
Genomic positionDPSCF300008 - 1461290-1464643
RNAseq coverage175x (Rank: top 50%)
Annotation
HeliconiusHMEL0062860.070.07% 
BombyxBGIBMGA012059-TA2e-11163.55% 
DrosophilaCG7246-PA4e-4224.59% 
EBI UniRef50UniRef50_UPI00015B5C787e-7230.45%UPI00015B5C78 related cluster n=1 Tax=unknown RepID=UPI00015B5C78
NCBI RefSeqXP_395867.33e-7433.61%PREDICTED: similar to RIKEN cDNA 4732497O03 [Apis mellifera]
NCBI nr blastpgi|3287888094e-7933.39%PREDICTED: u3 small nucleolar RNA-associated protein 6 homolog [Apis mellifera]
NCBI nr blastxgi|3800277292e-8333.11%PREDICTED: U3 small nucleolar RNA-associated protein 6 homolog [Apis florea]
Group
KEGG pathway 
InterPro domain[9-91] IPR0139492.5e-19U3 small nucleolar RNA-associated protein 6
Orthology groupMCL12242 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207243-TA
ATGGCTGAACAAGTAAATCAACGTATAGAAGGTATGATAAATGAGTTGGAGCAAATGCGTAGAACTAATCTTTACGAGGACGATGAAATAAGAGAAATTTCCCGTAAACGAAAGGAGTTTGAATATAAAATACAACGAAGAATAAAAGAGAAAAGCGACTTTGTTCAATACATTGCATTTGAATTGGCTCTTCTAGAAGATATTTCTCTTAGAAGGAAACAAGCAAAGTTGGGGGAAAAGAAGAAAGATATTGAATATGCCATTGCTAAGAGACTAAATAAAGTTTTCAAACAATTTATATTTCGTTTTCAAAATGACATAGCTATTTACTTTGAGTACATAAAATTCTGTCAAGCTGTTGGATTTGATTATGCGGTATCTGCTATTATTGACCAAATGTTGAGAGTACATGGTGATAAGCCCAAAACATGGCAGTTGGCAAGCAAATGGGAAAGCAAGGAACAGAACAATCTAGAAAATGCTAGAAATTTTTTACTTAAAGGCATTCACAGACATCCCAATTCTGATATATTATACTTAGATCTTTTTGATATCGAACTTATGATTGCTTTTAAAACTGAAGATGAAACAGAAAAAGCAAAAAATTTCAAAAGGGCCGATGTTGTATGGAGAAATGGAATGAAGAACATTCCGGATGTGAATTATTTATTTAAATTATGTGATATATCATTGAGGTATGGTGTTAACGAGGACATATCCAATTCTATAAAGCAAGAAATATGGAACAGAAGATCCGAAAAGCGAGTTTGGTCATACATTGCTTCTAAAGAATTGGAGGGATATCACTGGAAAGATATTGAGGAGTATGTGAGTGAAGAGTTCAGTTATTCAAAAGAATTGAACTATTATATAGCTGTGTATGAGGAAGCTTTAATGCAGTTTCCAGATGAAAATCTATCAACTATGTACATTCATGGGTTACTCGGTTTAAAGGATAATTTGTGTACAGATTTACAAAAAATTTGTGCCGTCAAACAAGCATGGTTCTTCAGTCACGAGAACGGGTTGCTGAGTAATGATATGTATGTTTTTGGTATAAAAATGTTAAAGTTAGAAGGCGAGATTACTGAAACTCAATTAACTGAGGTTCTGGATACAGCATTGGCAAAAAATCCGCTTTATAGATATTTATGGGAAGAGAAAATTTTATTACACAAAAATGATGAAGATGTAATTCTAAAAACACTAAAAGATGCCACAAAAATTTTAAAAACTGATGATGTTAGATGTCTTTGGAATTTTGTTTTTGATAATATTGAGTCACATATGGTTTTTAAGAATTGTTATTCTAAGCTACAGTCATGTGAAAGTGTTGTATTTATGACTCTCAAACCCACACTCTTAAAGAAGATGTATGAACACAATGGATTGAAGGCGGCACGGCAAGTTTACGAAGAGTGTATAAGAACCCCTCCCACACAAGAAGAAGTACATAGCATAATGATTGACATTGAAATGAATCAAGAAAAACCATCACTCAAGAATGTAAGAAAATGCTATGAGGCCCTAGTTCAACATCATGGCAAGAGTAATATTAAAGTTTGGATGGACTACATAAACTTTGAACAAAAGTATGGAAATGCCCAAGCTGTGCCTTCCATTCATAGGAGGGCTATAGGAATGTTGGATAAAAATTCTGTCGATGATTTTATCAAAGCTCAAACGCTAGCAAAATTAAATTAA

Protein sequence:

>DPOGS207243-PA
MAEQVNQRIEGMINELEQMRRTNLYEDDEIREISRKRKEFEYKIQRRIKEKSDFVQYIAFELALLEDISLRRKQAKLGEKKKDIEYAIAKRLNKVFKQFIFRFQNDIAIYFEYIKFCQAVGFDYAVSAIIDQMLRVHGDKPKTWQLASKWESKEQNNLENARNFLLKGIHRHPNSDILYLDLFDIELMIAFKTEDETEKAKNFKRADVVWRNGMKNIPDVNYLFKLCDISLRYGVNEDISNSIKQEIWNRRSEKRVWSYIASKELEGYHWKDIEEYVSEEFSYSKELNYYIAVYEEALMQFPDENLSTMYIHGLLGLKDNLCTDLQKICAVKQAWFFSHENGLLSNDMYVFGIKMLKLEGEITETQLTEVLDTALAKNPLYRYLWEEKILLHKNDEDVILKTLKDATKILKTDDVRCLWNFVFDNIESHMVFKNCYSKLQSCESVVFMTLKPTLLKKMYEHNGLKAARQVYEECIRTPPTQEEVHSIMIDIEMNQEKPSLKNVRKCYEALVQHHGKSNIKVWMDYINFEQKYGNAQAVPSIHRRAIGMLDKNSVDDFIKAQTLAKLN-