Monarch geneset OGS2.0

DPOGS208753
TranscriptDPOGS208753-TA912 bp
ProteinDPOGS208753-PA303 aa
Genomic positionDPSCF300043 + 611742-614242
RNAseq coverage3x (Rank: top 90%)
Annotation
HeliconiusHMEL0058141e-7074.38% 
BombyxBGIBMGA003414-TA4e-9984.65% 
Drosophilacyp33-PA2e-12768.79% 
EBI UniRef50UniRef50_Q9UNP99e-12571.04%Peptidyl-prolyl cis-trans isomerase E n=144 Tax=Opisthokonta RepID=PPIE_HUMAN
NCBI RefSeqXP_001810193.13e-14176.85%PREDICTED: similar to putative peptidyl-prolyl cis-trans isomerase E [Tribolium castaneum]
NCBI nr blastpgi|3504107272e-14281.14%PREDICTED: LOW QUALITY PROTEIN: peptidyl-prolyl cis-trans isomerase E-like [Bombus impatiens]
NCBI nr blastxgi|3071895738e-13981.14%Peptidyl-prolyl cis-trans isomerase E [Camponotus floridanus]
Group
Gene OntologyGO:00064571.9e-221protein folding
GO:00037551.9e-221peptidyl-prolyl cis-trans isomerase activity
GO:00037231.9e-221RNA binding
GO:00036761.1e-25nucleic acid binding
GO:00001662.3e-25nucleotide binding
KEGG pathwaytca:1001423188e-141 
 K09564 (PPIE)maps-> Spliceosome
InterPro domain[1-303] IPR0163041.9e-221Peptidyl-prolyl cis-trans isomerase E
[138-301] IPR0021309.8e-73Peptidyl-prolyl cis-trans isomerase, cyclophilin-type
[139-302] IPR0158911.3e-72Cyclophilin-like
[10-83] IPR0005041.1e-25RNA recognition motif domain
[4-93] IPR0126772.3e-25Nucleotide-binding, alpha-beta plait
Orthology groupMCL12666 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208753-TA
ATGGCCAGCAAACCTAATTCAAAGCGAACTATTTATGTTGGTGGTTTAGCCGAAGAAGTCGATGAAAAAATTTTAAATGCAGCTTTCATACCGTTTGGAGATCTTGTTGATGTCCAAATTCCTCTGGATTATGAGTCAGAAAAGCATAGAGGGTTTGCATTTATTGAGTTTGAAAACGCAGAGGATGCAGCGGCTGCAATCGACAACATGAATGATTCAGAACTATTTGGTCGAACAATCAGAGTCAACATAGCAGCACCTCAACGTATCAAAGAAGGATCCACTCGTCCCGTGTGGTCGGAAGACAGCTGGCTGCAGAAACATGCCGGTGGAACATTAAATGTGGATAAGGAAGATTCAAATAATGCCAAGGAAAAGACTGAGGCTTCAAATGAGCCGGCTAAACCGATCGAAAAGCGTAACCCTCAGGTATACTTCGACATAAGTGTTGGAAAGCAGGAGATCGGAAGAATCATAATGATGCTCAGGGCTGACATAGTACCAAAAACCGCGGAGAACTTTAAGGCTCTGTGCACACATGAAAAAGGTTTCGGTTACCAGGGCAGCAGTTTTCATAGGATCATACCAGACTTTATGTGTCAAGGCGGTGATTTTACGAACAACAATGGAACAGGTGGCAAATCGATATATGGAAGGAAGTTTGAAGATGAGAATTTCACACTAAAACACACAGGACCTGGTATACTTAGTATGGCTAATTCTGGACCTAACACCAACGGCTCTCAGTTCTTCCTTTGTACTGCCAAGACCGATTGGTTGGATGGGAAGCATGTTGTTTTCGGTCACGTATTATCTGGACTCGACGTGCTGAAAAAGATGGAGCGCTATGGGAGCAAGACTGGCGCTCCCTCAGCTAAGGTCGTCATAAGTAACTGTGGTGAATTACAATGA

Protein sequence:

>DPOGS208753-PA
MASKPNSKRTIYVGGLAEEVDEKILNAAFIPFGDLVDVQIPLDYESEKHRGFAFIEFENAEDAAAAIDNMNDSELFGRTIRVNIAAPQRIKEGSTRPVWSEDSWLQKHAGGTLNVDKEDSNNAKEKTEASNEPAKPIEKRNPQVYFDISVGKQEIGRIIMMLRADIVPKTAENFKALCTHEKGFGYQGSSFHRIIPDFMCQGGDFTNNNGTGGKSIYGRKFEDENFTLKHTGPGILSMANSGPNTNGSQFFLCTAKTDWLDGKHVVFGHVLSGLDVLKKMERYGSKTGAPSAKVVISNCGELQ-