Monarch geneset OGS2.0

DPOGS200616
TranscriptDPOGS200616-TA2001 bp
ProteinDPOGS200616-PA666 aa
Genomic positionDPSCF300076 - 1420-3420
RNAseq coverage0x (Rank: top 95%)
Annotation
HeliconiusHMEL0147370.066.37% 
BombyxBGIBMGA008977-TA0.071.96% 
DrosophilaCG5808-PA3e-16662.96% 
EBI UniRef50UniRef50_Q9XYZ64e-16462.96%CG5808 n=43 Tax=Eukaryota RepID=Q9XYZ6_DROME
NCBI RefSeqNP_651291.17e-16562.96%CG5808 [Drosophila melanogaster]
NCBI nr blastpgi|213556771e-16362.96%CG5808 [Drosophila melanogaster]
NCBI nr blastxgi|1571109422e-17851.82%cyclophilin-6 [Aedes aegypti]
Group
Gene OntologyGO:00064572.4e-47protein folding
GO:00037552.4e-47peptidyl-prolyl cis-trans isomerase activity
GO:00001661.6e-22nucleotide binding
GO:00036761.5e-20nucleic acid binding
KEGG pathway 
InterPro domain[1-185] IPR0158911.1e-50Cyclophilin-like
[1-170] IPR0021302.4e-47Peptidyl-prolyl cis-trans isomerase, cyclophilin-type
[231-319] IPR0126771.6e-22Nucleotide-binding, alpha-beta plait
[241-314] IPR0005041.5e-20RNA recognition motif domain
Orthology groupMCL12135 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200616-TA
ATGTCAGTGGTAATTGAAACCACACTGGGAGATATTACTGTAGATCTTTATTTAGATCAACGACCAGTAACATGTTTAAACTTCTTGAAGTTGTGCAAGATGAAATACTATAACTACAATTTGTTTCATACAATACGAAACGGTTTCATTGCTCAAACTGGTGATCCCAGTGGTGAAGGATCGGGAGGACAATCAATATGGGGTATTCTAGAAGGACCGCACAAACGCTTTTTCTCTGGCGAGAAAATGCCCAAGATTCGTCACACCGACGCTGGTTTGCTATCCATGGTATGTACCGATGATATGATGGTAGGTTCGCAATTCTTTTTCACACTTGCGCCCGATTTAAACTCTTTAGATGGAACACATTGTGTAATAGGTGAAGTAACTGAAGGGCATGATGTACTAACTAAGCTTAATGAAGTAATATGTGATGAATCCTATCGTCCATACAGAGATGTAAGAATTACACATACAGTTGTACTTGAAGATCCTTTCAACGACCCTCCCGGGTTGAGAGCTCCATCAAGGTCGCCATCTCCAAGTGCAGAACGCTTAAAGGGTGGTAGAATTGCTCCAGATGAAGAAATTGATGAAGCACAGGGCAAAACTGCCGAGGAAATCCAAGAAATGATTGAAGAAAAGGAAGCAAAGGCTAGAGCAACAATTTTAGAAATTGTTGGTGATTTACCTGATGCTGAAATAGCACCTCCTGAAAATGTTTTATTTGTATGTAAACTCAATCCAGTCACAACAGATGAAGACTTGGAAATAATATTTAGTCGTTTTGGTAAAATAGTAAGCTGTGAAGTTATTAGAGACAAAAAGACTAGTAACTCTTTACAATATGCTTTTATTGAATTTGACAATAAAAAATCTTGTGAAGATGCTTACTTTAAAATGGACAATGTGTTGATTGATGATCGCCGAATTCATGTTGACTTTTCACAGTCTGTATCTAAAATGAGATGGTTAGGGAAAGGTCGAGGTGTTCAGTATTTTGATGACGATTCAAACAAAAAATCTCAAAGAAACCCAAATAGAAATAGTCACAGACAACAACAAACTGATGATACAAATCATAGGAGGCGTTATGAAAATGATAGAAATGGTAAAGAAAGTCATAGACATGATAAAGAAAGAGACAGTTATAGGGACCACAAAGATAAGCCAAAAGATAAAAATGTTAGGAGTAGTGATAGAGAAAGTAAAATATATAACCGAAAGCGTAGTAGAAGTCGATCTAGAAGAGATAATTCAAAAGATAGAATACACGAAAGAAGACAAGATAAGACGAGATATCCTAAAGATATGAGTAAGTCTAGAGAGGACAGAGAAAAAAAGAAAGAATCACATGAATATGTTGAGAATAAAAGGGATAATCGTAATTATTATAATTCAAGATCATCACCGAGTAAAAAATTTAGAAAAGAATCACCATCAAAACGGCATAGAGAGAGAGAATTAAGCAGGAGTAGAGATAGGAGTGACAGAAACAGACAAGAAAATAATGGTGTTCATCCAAGGTATAGAGAATCTAGTAATTATACACACAATAAAGAAAGTAAAAGTGAAAAAATTAAAGAAAATACTAAAGGTATAGAATCAAAGACACCATCACCAGTTACGAAGGAAAATAAAACACAATCCTCAAAATCACAAAAGAAAACCAAAAAGAGTTGCAAAGAGAAGACAGAAGACTCAAAATCCAAAAAGAAGGAACGTAAAACTAAAAAACGTAAGCGCTCAACATCATCTTCAGAATCATCGGATTCAAGTTCTAGCAGTGATAGCAGTTCGAGTGAAAGTGATAGAAGGAAAAAGAAAATCAGAAGAAAAAAGAGAAAATATTCAACATCTAGCAGTAGTTCTGAGAGCACCTCAAGCAGCTCATCAAGCAGTTCGGACACCAGTACAGCGTCAAGTTCCAGTACCGATAGTAGCTGCAAAAAGAAACGTCAAAAGAAAAAACTTGTGAAGAAAAGCAAAAAGAAGTTATAG

Protein sequence:

>DPOGS200616-PA
MSVVIETTLGDITVDLYLDQRPVTCLNFLKLCKMKYYNYNLFHTIRNGFIAQTGDPSGEGSGGQSIWGILEGPHKRFFSGEKMPKIRHTDAGLLSMVCTDDMMVGSQFFFTLAPDLNSLDGTHCVIGEVTEGHDVLTKLNEVICDESYRPYRDVRITHTVVLEDPFNDPPGLRAPSRSPSPSAERLKGGRIAPDEEIDEAQGKTAEEIQEMIEEKEAKARATILEIVGDLPDAEIAPPENVLFVCKLNPVTTDEDLEIIFSRFGKIVSCEVIRDKKTSNSLQYAFIEFDNKKSCEDAYFKMDNVLIDDRRIHVDFSQSVSKMRWLGKGRGVQYFDDDSNKKSQRNPNRNSHRQQQTDDTNHRRRYENDRNGKESHRHDKERDSYRDHKDKPKDKNVRSSDRESKIYNRKRSRSRSRRDNSKDRIHERRQDKTRYPKDMSKSREDREKKKESHEYVENKRDNRNYYNSRSSPSKKFRKESPSKRHRERELSRSRDRSDRNRQENNGVHPRYRESSNYTHNKESKSEKIKENTKGIESKTPSPVTKENKTQSSKSQKKTKKSCKEKTEDSKSKKKERKTKKRKRSTSSSESSDSSSSSDSSSSESDRRKKKIRRKKRKYSTSSSSSESTSSSSSSSSDTSTASSSSTDSSCKKKRQKKKLVKKSKKKL-