DPGLEAN08191 in OGS1.0

New model in OGS2.0DPOGS211046 
Genomic Positionscaffold40:+ 132848-154305
See gene structure
CDS Length1479
Paired RNAseq reads  137
Single RNAseq reads  633
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003797 (5e-55)
Best Drosophila hit  CG42313 (5e-55)
Best Human hitnephrin precursor (7e-06)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC016379 [Tribolium castaneum] (6e-79)
Best NR hit (blastx)  GD21979 [Drosophila simulans] (5e-57)
GeneOntology terms

  
GO:0003674 molecular_function
GO:0005575 cellular_component
GO:0008150 biological_process
InterPro families





  
IPR007110 Immunoglobulin-like
IPR003961 Fibronectin, type III
IPR013151 Immunoglobulin
IPR008957 Fibronectin type III domain
IPR013783 Immunoglobulin-like fold
IPR003599 Immunoglobulin subtype
IPR003598 Immunoglobulin subtype 2
Orthology groupMCL18968

Nucleotide sequence:

ATGCTCAGTGATTTAATATCCCATGAATCACGATCACTCGCCGCCGTAACAAACAACTTC
GTCTATTTGAAATTTGTTTCATGTCACGTGACGTGGCTCGAGGAGCTCCGCGATAACTTA
ACGATAAGTGTGATGAGCTTCGTGCCGACTCTAGACGACGATGGTAAACCCATCACGTGT
CGCGCTGAGAACCCCAACGTCACGTCACTGTTCATGGAGACGACCTGGACTATTAACGTT
GTTTATCCTCCTGTAGTTAAACTAAGACTCGGCAGTTCACTAGCAGCTGGCGATATTAAG
GAGGGTGATGATGTCTATTTCGAATGTCACGTGAGAGCTAACCCGCCCGCAAGGAAACTG
TCCTGGCTGCATGATGCAGCCCGAGTAAGGTCTTTTCAAATTTGGTTAGGTTGGAAGAGG
AGCTTGGACGGTGGAAGTGTGTTAGATAGGGTCCCTTTAAACCTACACCTGGGTCGCAAG
TCCCAGGCACCGTTAAGGCTTCTCCCGTTAAGGCTTCTCGCCCTGCAACGCGATGGACCG
CAACGCGATGGACCTGGCACCCGCACGGATAGGCAGCTGGCACACAACGCCACCGCTCGC
GTCTTCCACAGCAACCAGAGCCTCGTGCTGCAGAAAGTAACGAGGCACAGCAGCGGACGG
TACGCTTGTTCCGCACTCAACGCCGAGGGAGAGACTGTCTCTAACGAACTGCACTTCCGA
GTCAAATTTTCACTATCTTCGACCTCACCCGCCGGCCTCACCTCAATCGGAGACTCGTTT
CAATCTACGGAAGCGATGGCCATTATAGAGATAGTTAAATTGCTAGATTCGAATACGATA
ATGCTTCCTCATTTTCCCAAACCGATGGAACGACTATTGACACATGCGCCCTCGTGTCGC
AGCGGCGGTGTGTCAGTGGTCGGCGCGGCGCGAGGCGAGTCCGTGGTCATCGTGTGCGAG
GTGGACGCGGACCCCCCTGCAGCAGTTTTTAAGTGGAAGTTCAACAACTCCGGCGAGACT
CTGGATGTGGCCGCCGACAGATACACCTCCAACGGCAGTGCTTCTAGTTTAAAATATACA
CCAGTAGCGGATTTAGACTACGGCACGCTCTCTTGCGCTGCATCCAATGAAGTGGGAGTC
CAGGTGGCTCCCTGTGTCTTTCAAATGGTCGCCGCTGGGAAGCCACACGCACCTCGTAAC
TGCACCTTATGGAACCAGACGGCCGATTCAGCTGAGGTGTCTTGTGTTTCGGGTTTTGAC
GGAGGACTACCGCAGCACTTTTTACTTGAGGTGTACTCCGGGAACGAAGATAAGCCCAGA
GTGAACCTCACAGCCGAGGAACCTGTTTGGACGGTGCGAGGGCTGGAGTGGGACGTGCGA
TTCAGGCTGGTGGCTGTAGCCGTCAACAGCAAGGGCCGCTCGGCGCCAGCGCGGCTCGAT
GATCTTCTGTTCCCCGACCCGGAGAAGAGAACCGGTTAG

Protein sequence:

MLSDLISHESRSLAAVTNNFVYLKFVSCHVTWLEELRDNLTISVMSFVPTLDDDGKPITC
RAENPNVTSLFMETTWTINVVYPPVVKLRLGSSLAAGDIKEGDDVYFECHVRANPPARKL
SWLHDAARVRSFQIWLGWKRSLDGGSVLDRVPLNLHLGRKSQAPLRLLPLRLLALQRDGP
QRDGPGTRTDRQLAHNATARVFHSNQSLVLQKVTRHSSGRYACSALNAEGETVSNELHFR
VKFSLSSTSPAGLTSIGDSFQSTEAMAIIEIVKLLDSNTIMLPHFPKPMERLLTHAPSCR
SGGVSVVGAARGESVVIVCEVDADPPAAVFKWKFNNSGETLDVAADRYTSNGSASSLKYT
PVADLDYGTLSCAASNEVGVQVAPCVFQMVAAGKPHAPRNCTLWNQTADSAEVSCVSGFD
GGLPQHFLLEVYSGNEDKPRVNLTAEEPVWTVRGLEWDVRFRLVAVAVNSKGRSAPARLD
DLLFPDPEKRTG