DPGLEAN05149 in OGS1.0

New model in OGS2.0DPOGS212543 
Genomic Positionscaffold4161:- 1563-7693
See gene structure
CDS Length1197
Paired RNAseq reads  168
Single RNAseq reads  513
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008132 (4e-101)
Best Drosophila hit  klingon (1e-37)
Best Human hitneurotrimin isoform 3 (1e-10)
Best NR hit (blastp)  PREDICTED: similar to lachesin, putative [Tribolium castaneum] (5e-76)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC003806 [Tribolium castaneum] (6e-75)
GeneOntology terms






  
GO:0007156 homophilic cell adhesion
GO:0019897 extrinsic to plasma membrane
GO:0007465 R7 cell fate commitment
GO:0045466 R7 cell differentiation
GO:0007611 learning or memory
GO:0008355 olfactory learning
GO:0007616 long-term memory
GO:0007615 anesthesia-resistant memory
InterPro families




  
IPR007110 Immunoglobulin-like
IPR003961 Fibronectin, type III
IPR003598 Immunoglobulin subtype 2
IPR008957 Fibronectin type III domain
IPR013783 Immunoglobulin-like fold
IPR013098 Immunoglobulin I-set
Orthology groupMCL16031

Nucleotide sequence:

ATGTTCAACCCGTATCTAGAGCCATTACGTGCGCTGGTTATTCGTCCAGAGCCCTTGGGT
CCCGTCAGACAGACACACGCCATCAAAGTTCAATATGCTCCAATAGTGAGGACTATACCT
GAAGAAGGTTATTTGGAAGTCAAGAAAGGTGAATACGTTGACATTGGTTGTGAGGCGACT
GGGACACCTACTCCTATAGTCAATTGGAAGAAGAATGGAGAGTCCATGGCGCTACTGGAA
CACAGGTCCAGGATTCGGTTCCGCGCTGAACACCGTCTTCTAGCTGGGGTGTACGAGTGT
ACAGCAACCAATGGCGTCGGCGACCCCATGACAGCGGCAATAACAGTTATAATACAAGAC
GCTCCAGTAGTAACCACATCTCGTAGTTTCGTTCATACGGCTATAGGGCTGAGAGCGGTG
CTGGCATCCAAGCTAGAGTTTGCAGCACCCCCAGCTCGCACGGCCTGGTACAGAGATGGA
AAACCAGTTCGGACAGACGACAGAATTATAATAATGGTCAAGGACAATGTCCATCAGTTA
ATATTTAGGAGCGTCCGGAAATCTGATTTCGGTAACTACACCTTCAGAGCTGAGAATAGT
CTTGGTATGGCCGATGTTTCGTTCAAATTGACGGGTGTTCCAAATACCGCGTCATTTAAA
GTGGATCCCTCTCTAAACAAAGCAGATGCAACAAGTTACACACTGCTGTGGGAAGTCGAT
AGCTACTCCAATATCATAGAGTATAATCTTTGGCTTCGTCCATACTACGGTCGTCCCGCT
ACCACGGAATCGGACTTCATAACGACCGAGACTCCAAACGTCTGGTCAAAGATCGTGGTA
CCGGGAGACTCTAATGAAGGTCCAATACACAGCGCCGCTTATTCCGTCAGAGGCTTAACT
CCGTCTACCGTCTATGAGGCGGTGGTCACGTCTAGAAATAGATTTGGATGGAGTAAGCCT
TCCGCTGTTCTACATTTCGCTACAGAGCCTGGAGCCGGAAAAATTTTACTCTCGACATCG
GATTACACCGATTTCACTCCAATCTTAGAAGATCCTGAACCACAGCAACTTTACAATATA
ACACAGGCGCAAGTTTTCGAACGTTTCAACGAACTGTCAAATTCATCGAGGCGGGAAAAA
ATATCATTAACCTGTATGTTTACTTTCACGTTTATATTATTTAAATACTTTTGTTGA

Protein sequence:

MFNPYLEPLRALVIRPEPLGPVRQTHAIKVQYAPIVRTIPEEGYLEVKKGEYVDIGCEAT
GTPTPIVNWKKNGESMALLEHRSRIRFRAEHRLLAGVYECTATNGVGDPMTAAITVIIQD
APVVTTSRSFVHTAIGLRAVLASKLEFAAPPARTAWYRDGKPVRTDDRIIIMVKDNVHQL
IFRSVRKSDFGNYTFRAENSLGMADVSFKLTGVPNTASFKVDPSLNKADATSYTLLWEVD
SYSNIIEYNLWLRPYYGRPATTESDFITTETPNVWSKIVVPGDSNEGPIHSAAYSVRGLT
PSTVYEAVVTSRNRFGWSKPSAVLHFATEPGAGKILLSTSDYTDFTPILEDPEPQQLYNI
TQAQVFERFNELSNSSRREKISLTCMFTFTFILFKYFC