DPGLEAN02084 in OGS1.0

New model in OGS2.0DPOGS209887 
Genomic Positionscaffold82:- 94485-101025
See gene structure
CDS Length2088
Paired RNAseq reads  383
Single RNAseq reads  1121
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004209 (2e-149)
Best Drosophila hit  CG10621 (3e-32)
Best Human hittranslin-associated factor X-interacting protein 1 (6e-39)
Best NR hit (blastp)  PREDICTED: translin-associated factor X interacting protein 1 isoform 4 [Pan troglodytes] (1e-54)
Best NR hit (blastx)  hypothetical protein BRAFLDRAFT_117631 [Branchiostoma floridae] (1e-49)
GeneOntology terms




  
GO:0030154 cell differentiation
GO:0007275 multicellular organismal development
GO:0005515 protein binding
GO:0048471 perinuclear region of cytoplasm
GO:0005737 cytoplasm
GO:0007283 spermatogenesis
InterPro families  IPR003726 Homocysteine S-methyltransferase
Orthology groupMCL21865

Nucleotide sequence:

ATGGAGAAATGGCACAGACCCCGTATTGAGGCTTTAATAGATGCCGGTGTAGATATTTTA
GCTTTAGAAACAATGCCATGTGGAAAAGAAGCTGAAATGCTTGCAAGTATGATAAAAAAT
TATGCTCAGATACCAGCCTGGATAACATTCAGTTGTAAAGATGATAGAAGTCTAGTTGAC
GGAGAGGATTTTCAAACAGTGGCACAAAGATGCTGGGAAATAAATCCAGAACAACTCATT
GGTATTGGAGTGAATTGTTGCTCACCTAAGGTTGTTGGGAATCTTTTTAAAGACATCAGT
AAGGGGATAGAGCCACCACTGAGTTTGGTTACATATCCAAATTCGGGAGAAAAATACACA
GAGGAAGGGTGGGGGGAGAGAGACTGTGATTCATTGGACACATATGTGCATGATTGGTTG
GATCTTAATGTGAAGTTTGTGGGAGGTTGCTGCAGGACCTACGCAGAAGATATTGCCCAA
ATAAGAAAGAAAAAGGTATTCGGTATTTCAATGAGAAAAAAACAAACTGATAACAAGGTA
AAAATGTTGTTACAAGAGAACAATAACTTGAAGGAAGAGAATCTTAAGCTGTCCTTGAAA
ATCTCAAGGTTAAAAAGAGAAATCGGAAAGATTAAAGACGACAAGTTTTCGGACTATTTG
TGTCTGATGCGCGAGCGTGACGCTCGATATACTTTGTACTTCGAGAACGTGGCTCTTCAT
CTGAAGCTGAGAGAGTTGGACGGAACTTCCAGATGTTACTCGGTCGACGACTCCTATGGT
GATCCTGTCATACTTAAAATTGCCTTAGAAAAGTGTCGGGAACAGTTATCTGCTGCGCAA
AATAATTTGAAAAGAATGAAAGAAGAGTACTCAGAGACCATACCGAGACGGGAACATGAT
TTACTTGACGCTAAATGTTACGAACTCACCAAGGAAGTGAACACTTTGAAAACAGAATAC
GATATTGTGCAAAATTCATATAAACGAGTCTTAGCCCAGAAGAAGAGCCTGGAGGAGGAG
TTACGTGAGGTGAAGGAGCGATGTAGTGAGCTAGAGCGATCTGGTACTCCTCGTCCATGC
TGGGAACTGTGCGGAGACTTCATCAGTGGCGGCCGGGATCGATGGTGGCAGTTAGCAAAC
GGTTTATCATCGAGGGACATGCTGAGGGTTCTGTTAAAGGAACTCGGGCCTGCAGCCGAG
AGCGAACATCTAGAATACTTCGATGGTTTGGGAACCGACCCAGCGATCCCTCCATACCTC
CGTCATGAAGGTCGCGTCCGCAATCTTCGTCTTTCCCGGCGTGAGGTGCGAGTCCTCGTT
AACGACATCTGGATATCTCGCGCCACTCACAGAGATCTTCCTCTACAGGATTACGTCACA
CGATACTTTGAAGAGAGGTACCAACAGCCATCAGTTCGTGCCGAGTGGTCGTACAACATG
TGCGCGGGTCTCGAGCAGATGATGGACGAACCGAGCGTGAAGCTGTTCTGGGGCGCGCTG
CTCGGACAGCTCAGCGAGGACGTGTACTGGCGGCTGAGGGACGCCTGGCACACGCTCAGA
GACAGACTCTACACACACAACGGTGGACAGGAGACCCTAAGTATTGAGGAATTGGAGAAG
GTGACGCGGACGACCTTCCCCTTGAAGAATGAAGTTGACATAAAGAACTTGATGGAAGTC
GTCAAGAAACAACTCAAACTGAAGATAAACGTCGTTGAGGTGAACTTGGATAAATTGTTC
TTTGAGAATGAAGAAGGTTTCGACCGATCGGAACTAGCGCGGGAATTATACAGGCAGAGA
CAACTGGCCCAAGATAAATACATACGAGAAGTGATCGCTGAACTCGGGAGGAGGAACCTT
AACAAGACCATTAGTGTGGACTGTCTTAAACGAGCCTTCGCCATCGTCGACCCGGCCATA
GATCACATCCGTATGGAACGCTACATCCGCTGGGCCTTCTCAGACCAAAGTTCTGAACTA
TCAGCCATTTGTCCTTTGCCGCTACGCACTATCGTATCCAGACTCGCATCGGGAGACATA
GAACGTATTGGACCGAGACATAAAGGATTTCGTAGATCTTATAAATAA

Protein sequence:

MEKWHRPRIEALIDAGVDILALETMPCGKEAEMLASMIKNYAQIPAWITFSCKDDRSLVD
GEDFQTVAQRCWEINPEQLIGIGVNCCSPKVVGNLFKDISKGIEPPLSLVTYPNSGEKYT
EEGWGERDCDSLDTYVHDWLDLNVKFVGGCCRTYAEDIAQIRKKKVFGISMRKKQTDNKV
KMLLQENNNLKEENLKLSLKISRLKREIGKIKDDKFSDYLCLMRERDARYTLYFENVALH
LKLRELDGTSRCYSVDDSYGDPVILKIALEKCREQLSAAQNNLKRMKEEYSETIPRREHD
LLDAKCYELTKEVNTLKTEYDIVQNSYKRVLAQKKSLEEELREVKERCSELERSGTPRPC
WELCGDFISGGRDRWWQLANGLSSRDMLRVLLKELGPAAESEHLEYFDGLGTDPAIPPYL
RHEGRVRNLRLSRREVRVLVNDIWISRATHRDLPLQDYVTRYFEERYQQPSVRAEWSYNM
CAGLEQMMDEPSVKLFWGALLGQLSEDVYWRLRDAWHTLRDRLYTHNGGQETLSIEELEK
VTRTTFPLKNEVDIKNLMEVVKKQLKLKINVVEVNLDKLFFENEEGFDRSELARELYRQR
QLAQDKYIREVIAELGRRNLNKTISVDCLKRAFAIVDPAIDHIRMERYIRWAFSDQSSEL
SAICPLPLRTIVSRLASGDIERIGPRHKGFRRSYK