Monarch geneset OGS2.0

DPOGS200723
TranscriptDPOGS200723-TA1215 bp
ProteinDPOGS200723-PA404 aa
Genomic positionDPSCF300030 - 169715-175138
RNAseq coverage30x (Rank: top 75%)
Annotation
HeliconiusHMEL0089654e-12978.85% 
BombyxBGIBMGA001122-TA2e-12471.48% 
DrosophilaCG7768-PA6e-2337.82% 
EBI UniRef50UniRef50_Q2F6112e-15284.26%Peptidyl-prolyl cis-trans isomerase E n=1 Tax=Bombyx mori RepID=Q2F611_BOMMO
NCBI RefSeqNP_001040183.13e-15384.26%peptidyl-prolyl cis-trans isomerase E [Bombyx mori]
NCBI nr blastpgi|1140517907e-15284.26%peptidyl-prolyl cis-trans isomerase E [Bombyx mori]
NCBI nr blastxgi|1140517902e-15184.26%peptidyl-prolyl cis-trans isomerase E [Bombyx mori]
Group
Gene OntologyGO:00064571.8e-32protein folding
GO:00037551.8e-32peptidyl-prolyl cis-trans isomerase activity
KEGG pathway 
InterPro domain[213-380] IPR0158915.1e-35Cyclophilin-like
[216-378] IPR0021301.8e-32Peptidyl-prolyl cis-trans isomerase, cyclophilin-type
Orthology groupMCL26474 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200723-TA
ATGCATCCATTGAGAGAGAGAGCGTGTGACCATTTGCTAGAACATCCGTTACTACGCACTCCACCACCCGAACCCGAGCGACCGCGGCCGCCTCGCGTACGGAAACCAAGAGAATTTAAACTTCCGCCAAGGTTTCCATTGCTCTCTAAAGATTTTTATGCAACATGGTGTAATCATCAAAAAGAGCTCCAAATGTCCAAGCACAAAGTGGACGATCATCCGCCGGAACTTTTTCCCGAATTATATTTGAAGCCTCGAAGGCTCAACTACTACGCTCGTCAGCTGGAAGAAAATATGAACGTTAACAAAGAACTGCTGAAACGAATCAATCTCATACAAAGGACTGGGGGTTTTGTGGATTGCTGGATGAACCCCGAGCCTACCGACACATACAATAAGCTGTTGTGCAAACAAAGACAACGTGTCTTGGATGAAATACGAAAACAAAACTTATATTTTTATTCGAGACTTCTTATAGCGAGATCGGAACAACCTCTCACAAGAGAACTGGAAAATGCTTGGCAGGACACAAAGCATAAACTCATTTTGGGTGCATCGCTTCCATTTATACTTTTTAAAACTGAGAAAATTGATCGTGATATCAGAGATCCTGCTTTTGACAAACCACCAAATGTAACACGTACAAAAGTATCTTTGGAAATTTGGGTTGTTGGAGGCTCAAAGATCGGACGTGTGATTGCTGAATTATTTAACGACATTGCACCTAAAACTTGTCAGTTATTCCTTAGCCTTTTAAAAGGCGATTCTAGAGGACACGCGTACATGGGGACCAGGTTTTTCAGAGTTGTACCGAATCTATACTGTCGTGGAGGTGACGTCGTCAAAGACAATGGTTTTGGTTGTTTTTTGCCAGAAGGTGAAGTAGAATTGATGACATCTGAGAACTTTAAATTAAAACACACCGTACCAGGAGTAATGTCTATGGCAGTCACCACAGACAATGAAGTTTGCGGACAGTTTAATATAATTTTCAAACCTCTGCCTCAATTTGACGGAAAAAATGTCGTCTTTGGAAGAATTATCGCGGGTCCAACTCAAGCCCTCGAGCGCATCAGTGCTTTAGGATTGCCACTCGGCACAACCACCTCCGACTGTGTTATACGCAACTGTGGCTGGTTCACACGTAGCGGACAATACAGAGATGGCAACCCCAATACTATCAAGTTTCCGCCGAGGAGAAAAGCTAAAAAGTAG

Protein sequence:

>DPOGS200723-PA
MHPLRERACDHLLEHPLLRTPPPEPERPRPPRVRKPREFKLPPRFPLLSKDFYATWCNHQKELQMSKHKVDDHPPELFPELYLKPRRLNYYARQLEENMNVNKELLKRINLIQRTGGFVDCWMNPEPTDTYNKLLCKQRQRVLDEIRKQNLYFYSRLLIARSEQPLTRELENAWQDTKHKLILGASLPFILFKTEKIDRDIRDPAFDKPPNVTRTKVSLEIWVVGGSKIGRVIAELFNDIAPKTCQLFLSLLKGDSRGHAYMGTRFFRVVPNLYCRGGDVVKDNGFGCFLPEGEVELMTSENFKLKHTVPGVMSMAVTTDNEVCGQFNIIFKPLPQFDGKNVVFGRIIAGPTQALERISALGLPLGTTTSDCVIRNCGWFTRSGQYRDGNPNTIKFPPRRKAKK-