DPGLEAN13007 in OGS1.0

New model in OGS2.0DPOGS204961 
Genomic Positionscaffold1156:+ 8396-13381
See gene structure
CDS Length1878
Paired RNAseq reads  1278
Single RNAseq reads  2898
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011130 (0.0)
Best Drosophila hit  CG7706 (7e-99)
Best Human hitkanadaptin (8e-82)
Best NR hit (blastp)  PREDICTED: similar to smad nuclear interacting protein [Tribolium castaneum] (5e-134)
Best NR hit (blastx)  PREDICTED: similar to smad nuclear interacting protein [Tribolium castaneum] (8e-118)
GeneOntology terms



  
GO:0005634 nucleus
GO:0005515 protein binding
GO:0005737 cytoplasm
GO:0005622 intracellular
GO:0003725 double-stranded RNA binding
InterPro families
  
IPR000253 Forkhead-associated (FHA) domain
IPR008984 SMAD/FHA domain
Orthology groupMCL14002

Nucleotide sequence:

ATGTCTGACAACATAGAAAACCCCGACGTGCCGAGCTCACCCAAAATTGAATTTAAAAAG
CCAATACTTTTCGGAAAGATCGGTAAACTACCAAAAAAGTCGAAAGCGGAACCTAGTGCA
ACAGAGGAAAAGAAAGATGAGGAAAAAAATGAAAATACGACTGAAGGTCACTCTAAAAGT
TCTTTACCGCCAGCAGTTTTGCTAAAAGAATTATCAATTCCTATACCATACAAGGAGCCA
AAATGGTCTGGATTTTGTCCGGAAGGATCGGACTATGCTTTGGAGGTACTGAAATCGGGT
ATGATCATGGAAAAAATCGATCTTACGAAAAAAGCCTTCTATGTATTTGGACGTCTTGCA
AATTGTGATGTTGTTATGGCACACCCGACAATATCCAGACATCACGCTGTTCTCCAATAC
AAGGCCTTCGCTAATGACGACGAGCCAGCATCCGGGTGGTATTTATTCGACCTGGGAAGC
ACCCACGGCACGTTCCTGAACAGGGATAGAATAAAGGAGCAACATTACACGAGGGTCAGG
GTGGGACATCAGATTAAATTTGGTTCTAGCACAAGAACTTACATTGTATTGGGTCCAGAC
TTTGATGCTGACGGTGAATCAGAACTGACAGTCACCGAAATAAGACAAAGGGCGCTCAAC
ATGAAGCTGGAGAGAGACAGAATGATAAGAGAAGCCATAGAGCAGAGGGAGAGGGATAGA
GTGGAGGAAGAAAGGAGGAGGGAGGAACAGGGAATTGACTGGGGGATGGGCGAGGACGCT
GATGATGAACCGGATCTGTCAGAGAACCCATACGCCTGTACAGCAAACGAGGAGTTGTTC
CTGGATGATCCAAAGAAGACACTAAGAGGTTACTTCGAGAGGGAGGGTTTAGAACTGGTG
TACGACTGTGATGAACGAGGAATTGGCCAGTTTCTGTGCAGAGTGGAGCTCCCGCTAGAC
GACGCCAGAGGCAGGCCGCTTGTAGCGGAAGTGCTTCACAAAGGAAAGAAAAAAGAGGCT
GTGGTGGCTTGCGCTCTAGAAGCCTGCAGGATACTGGACCGAGCTGGGTTGCTACGACAA
GCCAAACATGAGTCCCGCCGTAAGAAACAGCGTGACTGGTCGGCGGACGACTACTACGAC
TCCGACGATGACACCTTCTTGGACAGGACCGGGAGTGTGGAGAAGAAGAGACAGGCCAGG
ATGGAGAAGAACGGACTGAAGGACACTGAGAAACCACTCACATACGAGGATCTGCTCAAA
CAGATAACGGACATTGAGAACAAAATAGCATCAGAAGAGAAGATTCTAGAAGCTCTGCGA
GTGAAGAGCAAGCAGAGTGAGCTGGTCGACCACGAAGAGGATGCCTTGGACGAGTTCATG
AATACTCTGCACACGGGACACAGCATGGCTCATAAGGCTGAGATATCCAAAGCCAAGATG
AGCATACAGAAGCTAAAAACCGATCTGTCAAAAACCCGTCGCCTGTGCGAACTGGCTCGC
CCCGCGGACGCTCCTCCCCTCCTCAAGAAGGACAGCACACCCGCCATTAAACAGACACAC
GCAGTCACATACGGCAAGAGGATACGGTTAAAAGACGACAAACCGAAGCCAAAGATCATC
AAACAGAGCAAGCGAGAAGAGGAGTTCGTTGAGGAAATGGACTCCGACGAAGATAGTGAA
TCAAAACCCACACCCATCGTGGAAACTGAAAGCAAATCTGATAGTCCAGTCAGAAGAGAC
AGCGATGGCACCGTGGCTGTGGAGACGAAGAAATTGTATGGTCCGATGAGGCCGCCGGAG
AATTATGTTGTACCCGAAAATTATTACGACGAAGCAACTGACAGGGACCTGCCGGAAATA
GAAGAAGGAGTTGAATAA

Protein sequence:

MSDNIENPDVPSSPKIEFKKPILFGKIGKLPKKSKAEPSATEEKKDEEKNENTTEGHSKS
SLPPAVLLKELSIPIPYKEPKWSGFCPEGSDYALEVLKSGMIMEKIDLTKKAFYVFGRLA
NCDVVMAHPTISRHHAVLQYKAFANDDEPASGWYLFDLGSTHGTFLNRDRIKEQHYTRVR
VGHQIKFGSSTRTYIVLGPDFDADGESELTVTEIRQRALNMKLERDRMIREAIEQRERDR
VEEERRREEQGIDWGMGEDADDEPDLSENPYACTANEELFLDDPKKTLRGYFEREGLELV
YDCDERGIGQFLCRVELPLDDARGRPLVAEVLHKGKKKEAVVACALEACRILDRAGLLRQ
AKHESRRKKQRDWSADDYYDSDDDTFLDRTGSVEKKRQARMEKNGLKDTEKPLTYEDLLK
QITDIENKIASEEKILEALRVKSKQSELVDHEEDALDEFMNTLHTGHSMAHKAEISKAKM
SIQKLKTDLSKTRRLCELARPADAPPLLKKDSTPAIKQTHAVTYGKRIRLKDDKPKPKII
KQSKREEEFVEEMDSDEDSESKPTPIVETESKSDSPVRRDSDGTVAVETKKLYGPMRPPE
NYVVPENYYDEATDRDLPEIEEGVE