Monarch geneset OGS2.0

DPOGS200523
TranscriptDPOGS200523-TA909 bp
ProteinDPOGS200523-PA302 aa
Genomic positionDPSCF300119 - 453309-457752
RNAseq coverage780x (Rank: top 16%)
Annotation
HeliconiusHMEL0054902e-12565.64% 
BombyxBGIBMGA009341-TA1e-8653.42% 
DrosophilaCG8336-PB8e-5740.79% 
EBI UniRef50UniRef50_Q087524e-5841.78%Peptidyl-prolyl cis-trans isomerase D n=54 Tax=Euteleostomi RepID=PPID_HUMAN
NCBI RefSeqXP_002007987.15e-6244.00%GI13253 [Drosophila mojavensis]
NCBI nr blastpgi|453606231e-6446.58%peptidyl-prolyl cis-trans isomerase D [Xenopus (Silurana) tropicalis]
NCBI nr blastxgi|3269182697e-6343.84%PREDICTED: peptidyl-prolyl cis-trans isomerase D-like [Meleagris gallopavo]
Group
Gene OntologyGO:00064578.1e-37protein folding
GO:00037558.1e-37peptidyl-prolyl cis-trans isomerase activity
GO:00054883.6e-11binding
KEGG pathwayxtr:3945812e-65 
 K05864 (CYPD, PPID)maps-> Huntington's disease
    Calcium signaling pathway
    Parkinson's disease
InterPro domain[1-102] IPR0158917.5e-37Cyclophilin-like
[1-101] IPR0021308.1e-37Peptidyl-prolyl cis-trans isomerase, cyclophilin-type
[133-206] IPR0231141.4e-14Elongated TPR repeat-containing domain
[207-268] IPR0119903.6e-11Tetratricopeptide-like helical
Orthology groupMCL13876 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200523-TA
ATGATCCAGGGTGGTGATATCATACATGGTAATGGTACCGGCGGTGAGAGCATCTACGGCCTGACCTTCGAAGATGAAAACTTCAAGCTTATTCATGAAGCAGGTGTCCTCAGTATGGCGAATGCTGGGCCAAATACGAACGGATCACAATTTTGCATCACCAGTGTGCCTTGTCCACAACTAGATGGCACTAACGTAGTTTTTGGACGGGTGCTGGCTGGCCTGGGGATAGTTCAGGAGATACAGAGCCTGTCCAGTGACGATACGCCCTCGGTCGAGTGTGTGATCGATGACTGCGGTGAGATAGCGGACCTGGATACATGGGATGTTTGCTGTCAGGATGGAACTTTGGACAGATTGCCGGAACATCCCGAGGATATGAGAACCAACCTCACTATGGACGAGCTCGTTGAAAGTATTCGCCGGGTGAAGGAGAGTGGTAATGATCTGTTTGGTGCGGGACGGTATAAGGCCGCTGCGAGGAAGTACCGGAAATGTAACAGATACGTCACACAGGCACAGGAAGTGGCAGCCAAGGACGGGGATAAGTACCTGAGCGAGCTGTCCTCGTGCGGTCGTCACTGTTGTCTCAACCTGGCGGCGTGTCAGTGCCGTCTGAGGGACTACCGCGCCGCTCTGAGCAGCTGCGATCAGGTACTCGACGTGGACCCCAAGAACGAGAAGGCCCTCTATCGCCGCGGTCAAGCAAACTACGCTCTGAAGAACTACGAGGCGGCTCTGAGCGATCTCAAGCTAGCGGATAAAGTTTCCCCGCGGAACAAAGCCGTCCAGAAGCTACTGGAAGAGGTCCGCGCGTCCAACAAAAACTACAACGACATACAAAAGCAGCGGCTGTCAAAATTTTTTCGTGACCAAAAAGAAAAAAACACAACGTTCGGACATCACTGA

Protein sequence:

>DPOGS200523-PA
MIQGGDIIHGNGTGGESIYGLTFEDENFKLIHEAGVLSMANAGPNTNGSQFCITSVPCPQLDGTNVVFGRVLAGLGIVQEIQSLSSDDTPSVECVIDDCGEIADLDTWDVCCQDGTLDRLPEHPEDMRTNLTMDELVESIRRVKESGNDLFGAGRYKAAARKYRKCNRYVTQAQEVAAKDGDKYLSELSSCGRHCCLNLAACQCRLRDYRAALSSCDQVLDVDPKNEKALYRRGQANYALKNYEAALSDLKLADKVSPRNKAVQKLLEEVRASNKNYNDIQKQRLSKFFRDQKEKNTTFGHH-