DPGLEAN21843 in OGS1.0

New model in OGS2.0DPOGS206220 
Genomic Positionscaffold2656:- 4789-10395
See gene structure
CDS Length2280
Paired RNAseq reads  1089
Single RNAseq reads  3446
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009740 (0.0)
Best Drosophila hit  CG10907 (1e-74)
Best Human hitpeptidyl-prolyl cis-trans isomerase CWC27 homolog (2e-76)
Best NR hit (blastp)  PREDICTED: serologically defined colon cancer antigen 10 [Taeniopygia guttata] (4e-98)
Best NR hit (blastx)  PREDICTED: similar to CG10907-PA [Apis mellifera] (4e-83)
GeneOntology terms




  
GO:0016853 isomerase activity
GO:0006457 protein folding
GO:0003755 peptidyl-prolyl cis-trans isomerase activity
GO:0003674 molecular_function
GO:0008150 biological_process
GO:0005634 nucleus
InterPro families


  
IPR015891 Cyclophilin-like
IPR009450 Phosphatidylinositol N-acetylglucosaminyltransferase
IPR002130 Peptidyl-prolyl cis-trans isomerase, cyclophilin-type
IPR020892 Peptidyl-prolyl cis-trans isomerase, cyclophilin-type, conserved site
Orthology groupMCL13854

Nucleotide sequence:

ATGAGTAACATATACATTCAAGAACCGCCGGCATTAGGCAAGGTGCTGTTGAAGACCTCA
GCCGGTGACATAGACATTGAACTATGGACCAAGGAGACGCCTAAAGCTTGCCGTAACTTT
ATACAGTTATGTATGGAAGGATACTATAATGGTACAATATTCCATCGAGTGGTACCGGGA
TTCATAGTGCAGGGCGGTGATCCCAACGGGGATGGAACGGGTGGTGAATCAATATACGGA
GAACCATTTAAGGATGAGTTCCATTCTCGTCTGCGGTTCAACCGTCGAGGTCTGGTCGCG
ATGGCCAACGCTGGTAAGGACGACAACGGCTCCCAGTTCTTCATAACCCTGGGCTCCACT
CCCGAGCTGCAAAACAAACACACCATCTTTGGAAAGGTTACCGGTGATACAATATACAAC
GTCCTCAAACTGACGGAGGGTTTGATCACCGACGACAGACCTGATCATCCTCACCGCATC
ATCACGACTACAGTCATCATCAACCCCTTCACCGACATCACGCCTAGAGTGAAGGAAGTC
GTGCCAGAAGAGAAGAAGACGAAGAAGAAAGAGATACAGGGTGTGAAGAACTTGGGTCTT
CTGTCTTTCGGTGAGGACGCTGAGGAAGACGAGGATGAAGCCCAGGAGTACAGAGGGAAA
CCCAAGTCCACACACGACCTCCTACAAGATCCTACACTCATTAGCAAAACGGCTGCCGAA
CTGGAATTAGAAAATGACGTGAAAGACGATCAGAAGACAGAACAGAAGAAAGAACATACA
CTGTCAGCGGTCAGCAGCATAAGAGATAAGTTGAAGAAGAGAGAAAGAGATGGAAATACG
AATGAGAGGGAGAACAAGAAGGACGGCAGTCCGGAGGAAAAGAAGATGAGGGAAGAGAGC
GAGTCAGAGGAAGAAGAAGAGTATTATCTGGGGAAGGAAAGAGATATGGAGAGAAGAAAG
GAAACAGATCGCATACGTAACGAAATCAGACAGCTGAAGAAGGAAATGAGGGGTCCTAAA
GAAGTTAAAGAAGAAGTAAAAGAAAAAGAAACCAAGAAGACTGTAGAGGACAACGAAATG
TACAAAGAGTTTGTGGAAGAACAGGAGAAATACAAGAAAATGAAAGAGAAGATTCCTAAG
AAGGGAGCTGCCAGAGGACGAAGAACGAAATTCACGAGAAATACAACCAGAAGGAAAGTT
TGGGTCAAGAACTTGTACGAGAACCGCGACTTCCCAGACAATTACACGGACTCCAAGTTC
CTAGAAGAGTTACAGAAGAATCTCTTTATTGAAAAAGTGTCTCTGTCGCAAGCGGTCCAA
GGTTCGTTCAGGGTCGTGCTAAGGATGTGCCTGTGTGTTCTGTTCGGCGTGCTGTTCGTG
CATATGCATGACAAGCGGATCCACACGCACACAGTCATGTACGTATCCACGTCCGTCACG
TGTGGGTGCTACGTGATGTACGTGTGGGTGGAGGGTTGCAGGCTCTTGAGGCACCTGAAG
ATCGTCCTCATATACATAGTGCTAGGGTACATCCTGTCTCCCGTGCTGCACACGCTGACG
GACACAGTCAGCACCGACACCATACACGCGTGGTCCGTGTGCATGCTGGTCGTCCACCTC
ATCTTCTTCGACTACGGCGTGTCCTCCGCCTTCGTCTCCAACTCTCTGTCTATCAACGCC
GCCATCTTCTCGTCTGTGTGTCTAGTCTCGAGACTATCAACAGCATTCGACGCGTTCGTT
CTTCTGACTATCTCGGTTATCTTCTTTGTGCTGAGCCCTCAGTTGTTCTCCGTGTTCCTC
GGCTCTAGATTCTTCTTGTTTCTCTTCTCCATCACTCTGTTGATGACTGCTGTATCTCTG
TACACGGTGTCGTCTAGCCTGTTGTTGTATTTTGTGTTCTTAGTCCTTGTTGTTAGTGGC
TGGTGTCCCTTGATGTTCGTGAGGTGGCAGAAATATAAAGACAACATCCACGGGCCCTGG
GATGAGGCGATAATTCATAATTCAGATGAATTCGATGAAGAGGACTTCACGCTGCAGCTG
CTGGCCAAGTTCAAGACCAAGCTGCACGACATCAAGGAGAGGAGAAATGACGGTGACGTC
ACCGACGACGATGACGTCACCGATGATAAGTGGCTCGGTCACAGGTTACATTTTGAAGAC
AAAGGCGCCGTTCTGGCGAAGGACGCGTCCAGCAAGGGCGACGACTGGTTCGACATCTAC
GACCCGAGGAACCCCATCAACAAGAGGAAGAGAGAGAAGGACAGGAAGAAGAATAAGTAG

Protein sequence:

MSNIYIQEPPALGKVLLKTSAGDIDIELWTKETPKACRNFIQLCMEGYYNGTIFHRVVPG
FIVQGGDPNGDGTGGESIYGEPFKDEFHSRLRFNRRGLVAMANAGKDDNGSQFFITLGST
PELQNKHTIFGKVTGDTIYNVLKLTEGLITDDRPDHPHRIITTTVIINPFTDITPRVKEV
VPEEKKTKKKEIQGVKNLGLLSFGEDAEEDEDEAQEYRGKPKSTHDLLQDPTLISKTAAE
LELENDVKDDQKTEQKKEHTLSAVSSIRDKLKKRERDGNTNERENKKDGSPEEKKMREES
ESEEEEEYYLGKERDMERRKETDRIRNEIRQLKKEMRGPKEVKEEVKEKETKKTVEDNEM
YKEFVEEQEKYKKMKEKIPKKGAARGRRTKFTRNTTRRKVWVKNLYENRDFPDNYTDSKF
LEELQKNLFIEKVSLSQAVQGSFRVVLRMCLCVLFGVLFVHMHDKRIHTHTVMYVSTSVT
CGCYVMYVWVEGCRLLRHLKIVLIYIVLGYILSPVLHTLTDTVSTDTIHAWSVCMLVVHL
IFFDYGVSSAFVSNSLSINAAIFSSVCLVSRLSTAFDAFVLLTISVIFFVLSPQLFSVFL
GSRFFLFLFSITLLMTAVSLYTVSSSLLLYFVFLVLVVSGWCPLMFVRWQKYKDNIHGPW
DEAIIHNSDEFDEEDFTLQLLAKFKTKLHDIKERRNDGDVTDDDDVTDDKWLGHRLHFED
KGAVLAKDASSKGDDWFDIYDPRNPINKRKREKDRKKNK