Monarch geneset OGS2.0

DPOGS208754
TranscriptDPOGS208754-TA912 bp
ProteinDPOGS208754-PA303 aa
Genomic positionDPSCF300043 + 617161-619717
RNAseq coverage3x (Rank: top 90%)
Annotation
HeliconiusHMEL0058149e-7174.38% 
BombyxBGIBMGA003414-TA5e-9984.16% 
Drosophilacyp33-PA2e-12769.13% 
EBI UniRef50UniRef50_Q9UNP97e-12571.38%Peptidyl-prolyl cis-trans isomerase E n=144 Tax=Opisthokonta RepID=PPIE_HUMAN
NCBI RefSeqXP_001810193.12e-14177.18%PREDICTED: similar to putative peptidyl-prolyl cis-trans isomerase E [Tribolium castaneum]
NCBI nr blastpgi|3504107274e-14280.81%PREDICTED: LOW QUALITY PROTEIN: peptidyl-prolyl cis-trans isomerase E-like [Bombus impatiens]
NCBI nr blastxgi|3071895731e-13880.81%Peptidyl-prolyl cis-trans isomerase E [Camponotus floridanus]
Group
Gene OntologyGO:00064571.2e-221protein folding
GO:00037551.2e-221peptidyl-prolyl cis-trans isomerase activity
GO:00037231.2e-221RNA binding
GO:00036761.1e-25nucleic acid binding
GO:00001662.3e-25nucleotide binding
KEGG pathwaytca:1001423186e-141 
 K09564 (PPIE)maps-> Spliceosome
InterPro domain[1-303] IPR0163041.2e-221Peptidyl-prolyl cis-trans isomerase E
[138-301] IPR0021305.4e-73Peptidyl-prolyl cis-trans isomerase, cyclophilin-type
[139-302] IPR0158911e-72Cyclophilin-like
[10-83] IPR0005041.1e-25RNA recognition motif domain
[4-93] IPR0126772.3e-25Nucleotide-binding, alpha-beta plait
Orthology groupMCL12666 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208754-TA
ATGGCCAGCAAACCTAATTCAAAGCGAACTATTTATGTTGGTGGTTTAGCCGAAGAAGTCGATGAAAAAATTTTAAATGCAGCTTTCATACCGTTTGGAGATCTTGTGGATGTCCAAATTCCTCTGGATTATGAGTCAGAAAAGCATAGAGGGTTTGCATTTATTGAGTTTGAAAACGCAGAGGATGCAGCGGCTGCAATCGACAACATGAATGATTCAGAACTATTTGGTCGAACAATCAGAGTCAACATAGCAGCACCTCAACGCATCAAAGAAGGATCCACTCGTCCTGTGTGGTCGGAAGACAGCTGGCTGCAGAAACATGCTGGTGGAACATTAAATGTGGATAAAGAAGATTCAAATAATGCCAAGGAAAAGACTGAGGCTTCAAATGAGCCGGCTAAACCGATAGAAAAGCGCAACCCTCAGGTATACTTCGACATAAGTGTGGGAAAGCAGGAGATCGGAAGAATCATAATGATGCTCAGAGCTGACGTAGTACCAAAAACCGCGGAGAACTTTAAGGCTCTGTGCACACATGAAAAAGGTTTCGGCTACCAGGGCAGCAGTTTTCATAGGATCATACCAGACTTTATGTGTCAAGGCGGTGATTTTACGAACAACAATGGAACAGGTGGCAAATCGATATACGGAAGGAAGTTTGAAGATGAGAATTTTACATTAAAACACACTGGTCCCGGTATACTTAGTATGGCCAATTCTGGACCTAACACCAACGGCTCTCAGTTCTTCCTTTGTACTGCCAAGACCGATTGGTTGGATGGGAAGCATGTTGTTTTCGGTCACGTATTATCTGGACTCGACGTGCTGAAAAAGATGGAGCGCTATGGGAGCAAGACTGGCGCTCCCTCAGCTAAGGTCGTCATAAGTAACTGTGGTGAATTACAATGA

Protein sequence:

>DPOGS208754-PA
MASKPNSKRTIYVGGLAEEVDEKILNAAFIPFGDLVDVQIPLDYESEKHRGFAFIEFENAEDAAAAIDNMNDSELFGRTIRVNIAAPQRIKEGSTRPVWSEDSWLQKHAGGTLNVDKEDSNNAKEKTEASNEPAKPIEKRNPQVYFDISVGKQEIGRIIMMLRADVVPKTAENFKALCTHEKGFGYQGSSFHRIIPDFMCQGGDFTNNNGTGGKSIYGRKFEDENFTLKHTGPGILSMANSGPNTNGSQFFLCTAKTDWLDGKHVVFGHVLSGLDVLKKMERYGSKTGAPSAKVVISNCGELQ-