DPGLEAN03341 in OGS1.0

New model in OGS2.0DPOGS209136 
Genomic Positionscaffold2106:+ 5742-11183
See gene structure
CDS Length2049
Paired RNAseq reads  540
Single RNAseq reads  1578
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001309 (5e-136)
Best Drosophila hit  CG3803 (3e-117)
Best Human hitcytochrome c oxidase assembly protein COX15 homolog isoform 1 (4e-97)
Best NR hit (blastp)  AGAP001744-PA [Anopheles gambiae str. PEST] (8e-135)
Best NR hit (blastx)  AGAP001744-PA [Anopheles gambiae str. PEST] (8e-135)
GeneOntology terms

  
GO:0008535 respiratory chain complex IV assembly
GO:0005743 mitochondrial inner membrane
GO:0003824 catalytic activity
InterPro families




  
IPR003008 Tubulin/FtsZ, GTPase domain
IPR009003 Peptidase cysteine/serine, trypsin-like
IPR017975 Tubulin, conserved site
IPR000217 Tubulin
IPR002452 Alpha tubulin
IPR003780 Heme A synthase/Protoheme IX farnesyltransferase
Orthology groupMCL13754

Nucleotide sequence:

ATGGTCCCTAGGGTCGTTATGGTTGACTTAGAACCTACACCTATAGATGAGATCAGAACA
GGAGCGTATAGGCAACTGTTTCATCCAACATCATTAATTACTGGAAAAGAAGATGCAGCT
AGTAATTTTGCACGAGGATATTTTGGTGTGGGTAGAGAGATGATAGATATTGCTCTAAAT
CGTGTAAGAATAGCGGCGGAAGACTGCAGTTGCCTCCAAGGTTTTATTATCTTCCGATCT
TTCGGAGGAGGTACAGGATCTGGATTCACTGCACTATTACTAGATAGTCTCACTAAAGAT
TATGGTAAACTTTCTAAAATTGAATACGCTATATATCCATCACCAAAAATATCGCCGGTA
ATAGTAGAGCCGTACAACGCAGTACTGACTGCCCACGCTTGTATGAACACCGAGGACGTA
TGTTTTATTTTCGACAACGAAGCTCTCTATGATATACTAGCAAGGCTTCTGGATGTACCG
AGGCCCACATATACAAATTTAAACAGACTTATCGCACAGATTGGTGTAAATAATCAACCA
CCGACAACCGTCCCTGGGGGCGACTTAGCAGCTCTTCAAAGAGCAGTCGCGATGGTGTCT
AATTCTTCAGCTGTTCGTACCGCTTGGGAACGATTGTGTAAAAAAATGTTGGGTATGGCG
AATTTATGTCGGTACTCTCAACTTGTAAAAGTTGCTCCGACCAAACTGCTAGGATCAAAT
TCGGGTGTTAGCCGCTTAGTTTCAAGGCAGCTCATTACACCGATAAGAAACAGCAACCAC
AAGCACACCATATACAAGGGGTTTCAGATACAGAATATAATAAAATCAAATCCAATAATA
TTAAGATTCTGTTCATCATCACAACCAAAGAGGTCTAAGCTTGTTGGCTACTGGTTACTG
GGATGCAGTGGGATGGTGTTTACTGCTGTTGTTTTAGGCGGAGTGACTCGACTCACTGAG
TCTGGGTTATCTATGGTCACATGGAAATTGTTAGGAGAGAAGTTACCAAGAACTGATGAG
GAGTGGGAGACGGAGTTCAAGAAATATCAGCAGTACCCGGAGTATATATATAAGAATCAT
TCACTGACACTGTCCGAGTTCAAATGGATCTGGTATATGGAGTATGCTCATAGGACGTGG
GGTCGACTCATAGGGGCCTCTGTCTTCATCCCGGCCGCTGTGTTCTGGGCTAAGGGCTGG
TTCGACAAGGCTATGAAGATAAGGGTGTCCGCATACTGCGCGCTCGTTGCTGCACAGGGT
CTTATGGGTTGGTACATGGTGAAGTCAGGTCTTGAAGACAGATTTCAAGGGCCGTCGGAC
GTTCCGCGCGTGTCCCAGTACCGCCTGGCCGCTCATCTCAGTCTCGCCTTCATTCTGTAC
TCGGGGCTACTGGCCGGAGCCCTGCGGGTGCTCCGCCCCTTCCCTAAGGGAGCTCTCGTG
AGGATCAAAGAGCTGGCCGCCGTCACCGGACTCGCGCATGCCGTTAAAGCTATGGCGTTC
TTCACGGCTGTTTCAGGAGCGTTCGTGGCCGGTCTAGACGCGGGATTGGTCTACAATTCA
TTCCCGAAGATGGGTGACAACTGGATCCCGGACGACATCCTGTCCTTCGCCCCCACCATC
AAGAACTTCACGGAGAACCCCACGACAGTTCAATTCGACCATCGGGTCCTTGGCACCAGC
ACATTGATAGCGGCCACCACACTGTGGCTGATGGCGAGGGGCAGGCCACTGTCCCCGGTG
GCGAGGAGGGTGGTCAATGGAGTGGGAGCCATGGCCTGGCTACAGGTGTGCCTGGGTATC
ATGACGTTGGTCCACTACGTGCCCACTCCGCTGGGCGCGTCTCACCAGGCCGGTTCCCTC
GTCCTACTGTCGCTGGCAATCTGGCTCACTCACGAGATCAAGCTACTCAAGTACATACCA
AAGTCGACCGAGATGAGATGCGAGGGATGTGGCACCAACCTTCACTGGTGTAGTGCTGAA
GAGAGCACACGCGTTTGTACAGCAACATATAAAGGATTTAGTGCTAAAGAGTCCCGATAT
TGCGTTTAA

Protein sequence:

MVPRVVMVDLEPTPIDEIRTGAYRQLFHPTSLITGKEDAASNFARGYFGVGREMIDIALN
RVRIAAEDCSCLQGFIIFRSFGGGTGSGFTALLLDSLTKDYGKLSKIEYAIYPSPKISPV
IVEPYNAVLTAHACMNTEDVCFIFDNEALYDILARLLDVPRPTYTNLNRLIAQIGVNNQP
PTTVPGGDLAALQRAVAMVSNSSAVRTAWERLCKKMLGMANLCRYSQLVKVAPTKLLGSN
SGVSRLVSRQLITPIRNSNHKHTIYKGFQIQNIIKSNPIILRFCSSSQPKRSKLVGYWLL
GCSGMVFTAVVLGGVTRLTESGLSMVTWKLLGEKLPRTDEEWETEFKKYQQYPEYIYKNH
SLTLSEFKWIWYMEYAHRTWGRLIGASVFIPAAVFWAKGWFDKAMKIRVSAYCALVAAQG
LMGWYMVKSGLEDRFQGPSDVPRVSQYRLAAHLSLAFILYSGLLAGALRVLRPFPKGALV
RIKELAAVTGLAHAVKAMAFFTAVSGAFVAGLDAGLVYNSFPKMGDNWIPDDILSFAPTI
KNFTENPTTVQFDHRVLGTSTLIAATTLWLMARGRPLSPVARRVVNGVGAMAWLQVCLGI
MTLVHYVPTPLGASHQAGSLVLLSLAIWLTHEIKLLKYIPKSTEMRCEGCGTNLHWCSAE
ESTRVCTATYKGFSAKESRYCV