DPGLEAN07999 in OGS1.0

New model in OGS2.0DPOGS200725 
Genomic Positionscaffold402:- 17021-25057
See gene structure
CDS Length3429
Paired RNAseq reads  2471
Single RNAseq reads  6831
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001123 (1e-48)
Best Drosophila hit  Ranbp21 (0.0)
Best Human hitexportin-5 (8e-120)
Best NR hit (blastp)  AGAP011888-PA [Anopheles gambiae str. PEST] (0.0)
Best NR hit (blastx)  PREDICTED: similar to chromosome region maintenance protein 5/exportin [Tribolium castaneum] (0.0)
GeneOntology terms


  
GO:0008536 Ran GTPase binding
GO:0005488 binding
GO:0006886 intracellular protein transport
GO:0008565 protein transporter activity
InterPro families


  
IPR016024 Armadillo-type fold
IPR013598 Exportin-1/Importin-beta-like
IPR001494 Importin-beta, N-terminal
IPR011989 Armadillo-like helical
Orthology groupMCL13543

Nucleotide sequence:

ATGAACTCAGTAGGATTTGGAGAAAGGGAAGTGGCTTTAATAGCTGATGAATTAAGTCAT
GCTGTAGAGCTCACCCTTAATCCGACAGCCAATCAAGACGCCAGGAAACAGGCCTACACA
GTATGTGAAAGCTTCAAAGAAAATTCACCATGGTGTGCCCAAGCAGGTTTACTTCTGGCA
AGTGGAACTCAATACTCACCAGTTGTAAAGCATTTTGGTCTTCAACTCTTGGAGCACACC
GTTAAATATAGATGGACCCAGATTACACAACCAGAAAAGATATTTATTAAGGAAAATTCC
ATGAAACTTCTCTCAATGGGCGGCTGGGAGACAGGTCACTTGAATGATGCATTAGCTCGT
GTGATAGTCGAGATGATAAAGAGGGAATGGCCACAACAATGGCCAACACTTCTAGCTGAA
CTTAGTGACGCCTGTACTAGAGGACATTTGCACACGCAAATAGTCTTACATGTATTCCTG
AGGCTAGTTGAGGATGTAGCAACATTACAGACTTTAGAACAACACCAACGTCGTAAGGAT
ATATACCAGGCTCTGACGAGCAATATGGCGGAAATATTCTCATTCTTCATGAGACTCATA
GAACTACACGTACAAGAGTTTAGAGAGAAGACTGCAGCTGGTGATTATGCTGCAGCGGCA
AGCAATGGCAGGGTTGTTCAGGTTGTACTACTGACTCTCACAGGATTTGTGGAATGGGTT
TCGACAAACCATGTAGTGACAAATAATGGAAGGTTACTACAAATCTTATGTATATTGCTC
AGCGATGATGTCTTCCAACTACCAGCTGCTGAATGCTTACTACAAATTGTTAATAGAAAA
GGCTCGGTTCTCGCTGGTCTTGCGCAACAACTGACCTCAATGTGGAACTGTGCCACAAAC
GTGGACTCGTGGTTGCCCCTTCTCCTTGAGACGATGCTGCTTCTAACGTCACACCCCTCG
CACACATTGGCTCACACAGCTAACTCAGTGTGGCTCGCCTTCCTCAAACATGATCAGATA
TCAAAACTGCCACATGTACTGGCCATGGTGCCGAGGTGGTTACAAGCGGCCACACCTAAG
ATATTGAAGGTGTGGTCCCCTCGTGAATCTGTCGGCTGTGCGTGTGACTACGACAGCGAT
CACGAGTTCGCAGTGTTCTTCAATCGCGCAAGAACTGAGATGCTAGAGAGCTTCAGATAC
TGTATGACCGCAGCGCCTTTGGTGACTTGGTCTTATGTTGAGCAATGGACGAACACGGCG
TTAGACAAGGTGGACTCGTGTCCGCTACAGTGTGACGTCATGCACCCGCTACACGTGGAG
TGGGAGGCTCTCGCACAGGTCCTGGACGTGGTCCTATCTCGGCTGCTCCTGGCGGAGCCT
CGGCCCAATGTGTCGGAGGGTCTTCGCGTCCTCCAGCGCTGTGTGTCTTGTGAGCCCCGG
GCGCCTCTCATTCTCTCGCTGCTGTTGTCTTTCATATCAGCCCTGTTTGTGTTCCTGAGC
TGCGCGTACAGTCAGATGGCAGGTCCGGGGGTCGGGTCAGCGGGCGCGGAGCTTCTACCA
CGTGTTTTGGACAAGATATTCGCGTCGCTCGTGTACGAGGGGACGCCGCCTGAAGATCGT
AGTTCACGGAACGTAAAGAACGTGAGGCGACACGCCGCGGGGTTGTTGGTCAAACTCGGT
TCCAAGTACCCGCTGCTGCTGCTGCCGGTGTTCGGTCGTCTCCACGAGCTGACCCGCGCC
GCGCTCTCCCGCCCCGACCTCAGCGCCATGGAGAGCGTCACGCTGCAGGAGGCACTGCTG
TTGGTCTCCAACCATTTCTGTTGCTACGAACGACAGAGTGCTCTTGTCGCTCAAGTGATG
GGCGACTGTCGCGACCGGTGGGCCGCTTTAGCCGAACACATCCAGTCCGCGGCGGGGCTG
GCGCGGCTCATAGGGCTGGACGCGCCGCCCACGGACGACCCTGAGCGCGCGACCGCTCGC
CGGACCTTACTTCACGCCCTCACCCTCACGCTCGGTGTCGTAAAAAGAACTCAAGTGCCG
GCCGATCCGGACAAGGCGGCTCGCGGGGGCTTTGGCGCTGGCGTGACGGCTTCGGGGAAT
CCAGTGTGGAGGAACCCGTGCGGGGTGCACGTGCTGCCTCTGTTCCCCGGAGTGCTGGCG
TTGGCTCGCTCGCTGCACGAGCTGCACTCGCCTCACGCGGCCTCCCTGCGGCACGCGGGT
CACGCCCGGGCGTTGTTGCCGCCGCCGGCGGAGCGCCGCAACCTCCTAGGGCTGCGGGAC
GACGCGCCGTCCTCGCCCACCGGACAGCCGGCCTCACCGACCTCACCACAGGACCGTATG
CAGAATCTTCTTCATACGCTACATGACAATGTGTGTCAGCTGGTGGGCGCGGCGGCCTCG
TCTCTGGGTCGTGAGTTGTACAGCTGTGCAGGTCTGGCGCGCGTGGTGGCGGGGTCTCTG
CTGGGGGCTCCGGAGCGTCTGCCCGAGCACTGGCTCCGCCGGGTGCTGCGAGCCGCTCTC
AGACCACTCCTCCTGCACTGCCCGCCGGCTCACTACAGGGACGTGGCGCTGCCGCTGTTA
CAACATCTAGCTCCTATCATGTTGGTACATTTAAATCAGCGCTGGGACTACGTGACGTCG
TTATATGAATCAGGGAAGTTAGAAGAAGAGGGGGGCAGTGAGTCACAGGAGGTGCTGGAA
GATATGCTGGTTAGACATCTCACTAGGGAACACCTTGATCTACTGAAGGTTTGTTTAGTG
GAGGGCGGTGTCACCGCTGAGAGTACGGATATGGAGGATGAAACCGTAACACCAGTGAAT
CCTGGCCGCGCTCCGGAACAAGTTTCCGAGTTAGGAGGGATCATTATATCGGATCCCATA
GCGGGACCCGCGGTCTTACACACAATTCTCCGCGCCCTCACTTGGAACGATTCCACTTCG
TCTCTTCGCGCGTGTTCGCTGGCACTACCCGCTCTACGAGCGGCCTTGGCCGCTCTAGGC
GCCGGCGAGGCTGCAGGCGCTTTGGCCGCCGTTCTACACGCGCTCAGACTACACGGACAG
CATGAAGCGAATCAGGCTGCTCTACTGGCCCTCGCTGTCCAGACCTATGAGGCGCTCCGC
CCCGTGTACCCCGAGGTGTCGTCAGTGTTGTGTGCTATCCCCGGAGTGGAGCCGGTGGAC
GTGCAACGCCTGGACGACAAACTGGCCGCTCCCCCTCTCAGGCCCGCCAAGGTCGACAAG
AGCAAGAGGGACCTGTTCAAGAAGATCGCCTCCAGGTTGATAGGAAGAAACGTCGGACAA
CTGTTCAAGAAGGAGGTCTTCATACTGGATCTACCTACGATGAACATACAGAGAGACAAA
CCTCGCAGCGCCTTGGACACCGACGGGGCGGGACTAGAAAATCTTTTCAATACAAACGCT
CCAACCTAG

Protein sequence:

MNSVGFGEREVALIADELSHAVELTLNPTANQDARKQAYTVCESFKENSPWCAQAGLLLA
SGTQYSPVVKHFGLQLLEHTVKYRWTQITQPEKIFIKENSMKLLSMGGWETGHLNDALAR
VIVEMIKREWPQQWPTLLAELSDACTRGHLHTQIVLHVFLRLVEDVATLQTLEQHQRRKD
IYQALTSNMAEIFSFFMRLIELHVQEFREKTAAGDYAAAASNGRVVQVVLLTLTGFVEWV
STNHVVTNNGRLLQILCILLSDDVFQLPAAECLLQIVNRKGSVLAGLAQQLTSMWNCATN
VDSWLPLLLETMLLLTSHPSHTLAHTANSVWLAFLKHDQISKLPHVLAMVPRWLQAATPK
ILKVWSPRESVGCACDYDSDHEFAVFFNRARTEMLESFRYCMTAAPLVTWSYVEQWTNTA
LDKVDSCPLQCDVMHPLHVEWEALAQVLDVVLSRLLLAEPRPNVSEGLRVLQRCVSCEPR
APLILSLLLSFISALFVFLSCAYSQMAGPGVGSAGAELLPRVLDKIFASLVYEGTPPEDR
SSRNVKNVRRHAAGLLVKLGSKYPLLLLPVFGRLHELTRAALSRPDLSAMESVTLQEALL
LVSNHFCCYERQSALVAQVMGDCRDRWAALAEHIQSAAGLARLIGLDAPPTDDPERATAR
RTLLHALTLTLGVVKRTQVPADPDKAARGGFGAGVTASGNPVWRNPCGVHVLPLFPGVLA
LARSLHELHSPHAASLRHAGHARALLPPPAERRNLLGLRDDAPSSPTGQPASPTSPQDRM
QNLLHTLHDNVCQLVGAAASSLGRELYSCAGLARVVAGSLLGAPERLPEHWLRRVLRAAL
RPLLLHCPPAHYRDVALPLLQHLAPIMLVHLNQRWDYVTSLYESGKLEEEGGSESQEVLE
DMLVRHLTREHLDLLKVCLVEGGVTAESTDMEDETVTPVNPGRAPEQVSELGGIIISDPI
AGPAVLHTILRALTWNDSTSSLRACSLALPALRAALAALGAGEAAGALAAVLHALRLHGQ
HEANQAALLALAVQTYEALRPVYPEVSSVLCAIPGVEPVDVQRLDDKLAAPPLRPAKVDK
SKRDLFKKIASRLIGRNVGQLFKKEVFILDLPTMNIQRDKPRSALDTDGAGLENLFNTNA
PT