DPGLEAN14800 in OGS1.0

New model in OGS2.0DPOGS208937 
Genomic Positionscaffold31:+ 63535-70216
See gene structure
CDS Length1947
Paired RNAseq reads  2072
Single RNAseq reads  5297
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002409 (0.0)
Best Drosophila hit  ND
Best Human hitangiomotin-like protein 1 (6e-26)
Best NR hit (blastp)  angiomotin, putative [Pediculus humanus corporis] (9e-159)
Best NR hit (blastx)  angiomotin, putative [Pediculus humanus corporis] (3e-120)
GeneOntology terms



  
GO:0005515 protein binding
GO:0030054 cell junction
GO:0042802 identical protein binding
GO:0005737 cytoplasm
GO:0005923 tight junction
InterPro families  IPR009114 Angiomotin
Orthology groupMCL17324

Nucleotide sequence:

ATGGGGTCCATGCGACCTAATGGACGTTTCCTTTCATTTGCTAATAACATACAGAGAACT
CAAAAGCAACCTGTCCCAAGTGGATTTCCTCAAAGTCTATCCGGTAGTGAGACAGATGTG
TCCACATCAAATGAGAATCTGTCAAGAGAGGAGAGGTATGTTGTGAGGCACACAGCACGA
GTTGAACCACAAGGACAGGAAAATCAAAGTCAGACTAACAATAATAATAACAACAATAGA
AACACATTGAAGGATAATGTAGGGGGCAGTAACCGCAACTCACTAAAAGACTCAGTAGGT
GGAGGTAACAGCAATCGGAGTTCGCTGGATGTTTCATCATCATCATATAACACTCTGATC
ATTCATAACCAGGACGACTCCTGGTCTTCAAGACCAACACCAATTAGGGAACATGAAAGA
ACAAACAGTGAAGTTAAGCATTCAAATACACAGTCTTCCCCATACCACACTTTAAAAAAG
AGTGATGCAGTGAAGAAGCCTAGCGGGATTCCGCTACCCAAAGTTCACAAAGAGCAGACT
GTTACAGCGAGTGCTAATTATATTGATATTGGAGGTCAGAGGATATATACAAGCCCACCG
GATCAAGGGGTTCAGGAAATAAATGAAATACCGGATGATTTTCTGAATCAGTCATCAGTT
CTGAAACATCTTGCTAAGGAAGTAACCCAATCTCCGACACCTCGAGGGCTCACACCTCCA
GCGTCTCCCCACTCGACTCGAGCTCCCTCGAAACCCCGTGAAGAGAGGAAAGGAAAAGGA
TCGAAAGCTAAACTCAGTAAGGAGAAGTTGAATTTGTCAAGATCACAGCCCGATCTAACA
AGTGTTGGCGTCCGAGCAGTACCAGGTGGATCAGAGTCCAGCGGTTGGTGTAGTGGAGGG
GAGGGTTCTTTGGAGGAGGCTGATGACGCGTTTGCAGCTGTTCTGGACGCTCTTGCAGCT
GAGAACCACGCTCTAAAGAGACAGCTGGCTGACGCGTGCGAGCGAGTCGCTAAGACACAT
AAGTTGGAGCAGGAGGTGGAAAAGGTTCGTACTGCCCACGAGGAGCTCGTGGGCTCGTGC
GAGCGACGGGAGCGGCTGGAGAGAGCCGCTCGGGTCAGGCTGCAAGCTGACTGTAGACGC
CTACACGAGATCAACAGGGCTCTCAAACACCAGACGGAGTTACTGTCATCTGGAGGTCGA
GCGGAGGGCGGCGCTAGTGTGGAGGCTCTGCGGAAAGAACTACAAGGACGGGAGATGCTC
ATAGCACAACTCATTACACAGAATAAGGAGTTGGCTTGCGCTAAAGAGCGTCAAGAGATA
GAGATGTCAGCTCAGCGGGCGACTCTACAGGAACAGAGGACACACATCGACATACTGGAC
ACGGCGCTGACTAACGCTCAGGCTAACGTGGTCAGGCTGGAGGACGAGTGTCGTCACGCG
AGTGGGTACGTGGAGCGCGTGCTGGGTCTGCAGAGGGCGCTGGCGTCGCTGCAGCAGGCC
TCGGACAGGAGAGAACACACGGAGAGGAAACTCAGGGCGCAGCTCGAGACAGAACTACAG
GCTCTCAGGAAACGTGAGTGTGTGTGTGGCGGTGTGGATACCTCCGGTGTGAGTGGTGGT
GGGGGCGGCGGGGGAGGGGGCGCCGCGTGTGGGGGGGAAGCGGGGGCGGAGGCCGAGCTC
AGGCGGGCGCTGCGGTCGAGGGACGAGAGGCTGCTGGCTCTAGAGGGGGAGTGCGCCAAG
TGGGAACAGCGCTACCTCGAGGAGGCCGCACTCAGACAGGCGGCGGTGTCCGCAGCATCC
ATACCCAAGGACGCTAAGATCGCGGCCCTGGAGAAGACGTCGGCGGAGTCCGAGCGACTG
ATGGCAGAGGCTCGCAGCGAGAAGATACGGCACATGGACGAGCTGCACTCTGCACAGAAG
AAGGTCGCCGACCTGGAGAGCAGGTGA

Protein sequence:

MGSMRPNGRFLSFANNIQRTQKQPVPSGFPQSLSGSETDVSTSNENLSREERYVVRHTAR
VEPQGQENQSQTNNNNNNNRNTLKDNVGGSNRNSLKDSVGGGNSNRSSLDVSSSSYNTLI
IHNQDDSWSSRPTPIREHERTNSEVKHSNTQSSPYHTLKKSDAVKKPSGIPLPKVHKEQT
VTASANYIDIGGQRIYTSPPDQGVQEINEIPDDFLNQSSVLKHLAKEVTQSPTPRGLTPP
ASPHSTRAPSKPREERKGKGSKAKLSKEKLNLSRSQPDLTSVGVRAVPGGSESSGWCSGG
EGSLEEADDAFAAVLDALAAENHALKRQLADACERVAKTHKLEQEVEKVRTAHEELVGSC
ERRERLERAARVRLQADCRRLHEINRALKHQTELLSSGGRAEGGASVEALRKELQGREML
IAQLITQNKELACAKERQEIEMSAQRATLQEQRTHIDILDTALTNAQANVVRLEDECRHA
SGYVERVLGLQRALASLQQASDRREHTERKLRAQLETELQALRKRECVCGGVDTSGVSGG
GGGGGGGAACGGEAGAEAELRRALRSRDERLLALEGECAKWEQRYLEEAALRQAAVSAAS
IPKDAKIAALEKTSAESERLMAEARSEKIRHMDELHSAQKKVADLESR