Monarch geneset OGS2.0

DPOGS200143
TranscriptDPOGS200143-TA1557 bp
ProteinDPOGS200143-PA518 aa
Genomic positionDPSCF300128 - 329401-331983
RNAseq coverage184x (Rank: top 49%)
Annotation
HeliconiusHMEL0068794e-17464.48% 
BombyxBGIBMGA002786-TA0.083.43% 
DrosophilaCG7747-PA0.064.50% 
EBI UniRef50UniRef50_Q133567e-16858.21%Peptidyl-prolyl cis-trans isomerase-like 2 n=91 Tax=Eukaryota RepID=PPIL2_HUMAN
NCBI RefSeqXP_001662138.10.066.86%cyclophilin [Aedes aegypti]
NCBI nr blastpgi|1571311450.066.86%cyclophilin [Aedes aegypti]
NCBI nr blastxgi|3838636230.068.52%PREDICTED: peptidyl-prolyl cis-trans isomerase-like 2-like [Megachile rotundata]
Group
Gene OntologyGO:00064575.4e-70protein folding
GO:00037555.4e-70peptidyl-prolyl cis-trans isomerase activity
KEGG pathwayaag:AaeL_AAEL0120080.0 
 K10598 (PPIL2, CYC4, CHP60)maps-> Ubiquitin mediated proteolysis
InterPro domain[265-440] IPR0021305.4e-70Peptidyl-prolyl cis-trans isomerase, cyclophilin-type
[278-454] IPR0158913.2e-66Cyclophilin-like
[44-99] IPR0130831.2e-06Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL12668 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200143-TA
ATGGGGAAACGACAACATCAGAAAGATAAAATGTACCTCACTTACACTGAATGGACAACTTTATATGGTGGTAAAAGATCAGGGACAGCAGTTGAGGAGGACTCATCGTTTAAGCGCCTCCCATTTGATCACTGCTGCCTTTGTTTACATCCATTTGATAACCCTTATTGCGACAGTGACGGAAATATATTTGAATTGCAAGCTTTGACGGACTTCATCAAAAAGTTCAAAATCAATCCAGTCACAGGAAAGAAAATAGATGTTGCAAGTTTAATAAAGCTCAATTTCACAAAAAATGCTGAAGATTCCTACCATTGCCCAGTTTTATTCAAACCGTTCACTAAGAATTCACATATTGTGGCCATACGGACCACGGGTAATGTGTACTGCTATGAAGCGGTTGAACAGCTTAACATTAAGGGCAAGAACTGGAAGGATCTCCTCAACGACACTCCATTCGCAAGAACCGATATCATAACAATACAGGATCCGAACAATTTGAAAAAATTCAACATATCCACATTCCATCATATAAAAAACAATTTGAGAGTGGAAACCGAAGAGGAGATTGCACTCCGCAAAGATCCACATGCAAGATTAAAAACTGTATCAGCCGAAACAAAAGACATTTTACAAGAACTGGAAAAAGAGTACAAAGCTCCGGAAACTAAAGAAGTTAAAAAGGAGGTAGCTGACAAATTCAATGCAGCACACTACTCAACAGGAATGGTGGCGGCGAGCTTCACATCTACAGCGATGGCACCGGAAACTGTTCATGAAGCAGCAGTCATATGTGAGGATGAAGTAAAATATGATAGGGTTAAGAAAAAAGGCTATGTACGTTTAGTGACAAACTTAGGTCAACTAAACTTTGAACTGTACTGTGATTTGACACCTAAGGCATGTGACAATTTCATAAAACATTGTTTAAGTGGTTACTATAATGGCACAAAGTTTCACAGGTCAATAAGAAATTTTATGATCCAAGGTGGTGATCCAACGGGAACAGGTTTAGGCGGTGAGTCAATATGGAAGAAGCCATTTGAAGATGAAGTGAAACCAAACTTGCACCACACTGGCCGAGGCATACTTTCCATGGCTAATTCTGGACCTAACACAAATGGGTCGCAATTTTTCATAACTTTCCGTTCCTGTAAACAGTTAGATGGTAAACACACAATATTTGGTAAACTGGTTGGTGGTATCGATACACTTACAGCCATGGAACAGATCGAGGTCGACAACCGTGACAGACCTATAGAGGATATTGTTATAGAAGTTGCCCAGGTGTTTGTTGACCCATTTGCTGAGGCCGAAGAACAGCTTGCTGCGGAAAGAGCTGCTGAAACTAAAAAGCAAGCAGAGGCCGAAGGATCTGGCGTTAAACCCAAGAAGGCATCAGCAAAACCTCTTAAAGTGTTCAGGAGTGGAGTCGGGAAATATTTAAATTTACAAGAATCATCCGCTACCAGTAAAACTACAAAAAGCCAAGAGGTCCCCGCCAAGAAACCCAAGAAAGATGCCAATTATAACTTCGGTGCATTCGACTCTTGGTAG

Protein sequence:

>DPOGS200143-PA
MGKRQHQKDKMYLTYTEWTTLYGGKRSGTAVEEDSSFKRLPFDHCCLCLHPFDNPYCDSDGNIFELQALTDFIKKFKINPVTGKKIDVASLIKLNFTKNAEDSYHCPVLFKPFTKNSHIVAIRTTGNVYCYEAVEQLNIKGKNWKDLLNDTPFARTDIITIQDPNNLKKFNISTFHHIKNNLRVETEEEIALRKDPHARLKTVSAETKDILQELEKEYKAPETKEVKKEVADKFNAAHYSTGMVAASFTSTAMAPETVHEAAVICEDEVKYDRVKKKGYVRLVTNLGQLNFELYCDLTPKACDNFIKHCLSGYYNGTKFHRSIRNFMIQGGDPTGTGLGGESIWKKPFEDEVKPNLHHTGRGILSMANSGPNTNGSQFFITFRSCKQLDGKHTIFGKLVGGIDTLTAMEQIEVDNRDRPIEDIVIEVAQVFVDPFAEAEEQLAAERAAETKKQAEAEGSGVKPKKASAKPLKVFRSGVGKYLNLQESSATSKTTKSQEVPAKKPKKDANYNFGAFDSW-