Monarch geneset OGS2.0

DPOGS204885
TranscriptDPOGS204885-TA1317 bp
ProteinDPOGS204885-PA438 aa
Genomic positionDPSCF300307 + 25878-28667
RNAseq coverage5229x (Rank: top 2%)
Annotation
HeliconiusHMEL0063080.080.82% 
BombyxBGIBMGA011710-TA0.076.48% 
DrosophilaCG4169-PA2e-6335.19% 
EBI UniRef50UniRef50_Q2F6400.076.48%Ubiquinol-cytochrome c reductase core protein II n=1 Tax=Bombyx mori RepID=Q2F640_BOMMO
NCBI RefSeqNP_001106225.10.076.48%ubiquinol-cytochrome c reductase core protein II [Bombyx mori]
NCBI nr blastpgi|1638386840.076.48%ubiquinol-cytochrome c reductase core protein II [Bombyx mori]
NCBI nr blastxgi|1638386840.076.48%ubiquinol-cytochrome c reductase core protein II [Bombyx mori]
Group
Gene OntologyGO:00468723.5e-35metal ion binding
GO:00038243.5e-35catalytic activity
GO:00065082.5e-22proteolysis
GO:00042222.5e-22metalloendopeptidase activity
GO:00082701.6e-13zinc ion binding
KEGG pathwaytca:6568179e-97 
 K00415 (QCR2, UQCRC2)maps-> Huntington's disease
    Oxidative phosphorylation
    Alzheimer's disease
    Cardiac muscle contraction
    Parkinson's disease
InterPro domain[238-438] IPR0112373.5e-35Peptidase M16, core
[29-231] IPR0112498.5e-35Metalloenzyme, LuxS/M16 peptidase-like, metal-binding
[43-183] IPR0117652.5e-22Peptidase M16, N-terminal
[191-346] IPR0078631.6e-13Peptidase M16, C-terminal
Orthology groupMCL11876 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204885-TA
ATGTCGTCTAAAAACTTAGCTCTCCCTTTTGTACGCCACATAACAATCAGAGGTTATGCCCAGGCTGCGCCCGCGGTCAGAAAGGATGCGAAACTCCAGACCAGTGTTTTGCCTAATAAAACTTTCGTGGCCGCTGTGGATAATGGTTCTCCGGTGACTCGAGTCACTATTGCCTTCAAAGCTGGCTCCCGCTATGAACCTCAAACTGAGCTTGGTTTAGCACATGTCCTCCGTTCAGCTGCAGGTCTCACCACCAAGAATGCAAGTGCTTTCATGATATCACGCAAGCTCTCACAGATTGGTGCATCCTTTAATGCATCTGGTGATCGTGAATTTATTTACTACACCCTTGAAGCTTCACAAGAAAACTTGCAAGAAGCATTGGTGTACTTGAATAATATTGTGTCCAACCAAGAGTTTAGACCGTGGGAATTGAGTGACAATGTCTCCCGCTTGAAGTTTGACATACTATCTCTTTCACCTCAAGTCCGTGCTGTAGATTTGCTGCATAAAGCTGCTTTCCGCCGTGGACTTGGAAACTCTCTCTTCATTGCCCCAAAGAAGATTGGAAAAATCAGTTCCGAATCTTTGCAACATTTTGCTTCTACAACCCTTACTCCTGGCCGCTGTGCTGTATCTATTGTTGGGGGTTCTCAGGACACTGCTGCACTAGTCGCTCAAGCTCTTCAACTTCCAGCGGGTGGTGATGCTCAGAGCAATGCTTCTACCTACTTTGGTGGTGAATTAAGAAAAGAAATGGGAGGAGATCTAGCTCATGTGGCATTAGCTGTACCTGGAGCTCCAGCAGGCTCACCACAGAGTCTGGCTCTAGCTGTGGCTGCTAAAGCTCTTGGCAATGGACCAGTCACGAAATGGGGGGCCGACAATACTCCTCTGGCTAAGGCTATTGGTAACATTGGTCCATTTGCTGCTGCAGGATTCAATGTATCATATTCCGACAGTGGTCTGTTTGGAATTGTGCTATCTGTGCCCAAGGATGAAGCAAATGCTGCCCTTAAAGCTGCTTCCAAACTGCTCAAGAACCCAAACCTGAGTGGTGATGCTATCAAAGCTGGCAAGAACCAACTTAAATTGCAAGTGCTGTCTGAGGCAGAAAGTGGAGTTTCACTCTCTGAATCTCTAGCCGTCCAAGGATTATATACAGGCTCTGCTAAATCAGCTGTAGACATTGCTAAAGACATTGACCAACTGTCTTCTAATGATGTGTCACAGGTACTTGCCAGCGTCATCAAAAATAAGGTGTCTATGGGCGCAGCAGGAAACCTCGCATTTGTACCTTACGTTGATGAACTATAA

Protein sequence:

>DPOGS204885-PA
MSSKNLALPFVRHITIRGYAQAAPAVRKDAKLQTSVLPNKTFVAAVDNGSPVTRVTIAFKAGSRYEPQTELGLAHVLRSAAGLTTKNASAFMISRKLSQIGASFNASGDREFIYYTLEASQENLQEALVYLNNIVSNQEFRPWELSDNVSRLKFDILSLSPQVRAVDLLHKAAFRRGLGNSLFIAPKKIGKISSESLQHFASTTLTPGRCAVSIVGGSQDTAALVAQALQLPAGGDAQSNASTYFGGELRKEMGGDLAHVALAVPGAPAGSPQSLALAVAAKALGNGPVTKWGADNTPLAKAIGNIGPFAAAGFNVSYSDSGLFGIVLSVPKDEANAALKAASKLLKNPNLSGDAIKAGKNQLKLQVLSEAESGVSLSESLAVQGLYTGSAKSAVDIAKDIDQLSSNDVSQVLASVIKNKVSMGAAGNLAFVPYVDEL-