DPGLEAN17135 in OGS1.0

New model in OGS2.0DPOGS204574 
Genomic Positionscaffold266:- 26001-33956
See gene structure
CDS Length2169
Paired RNAseq reads  440
Single RNAseq reads  1126
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001979 (0.0)
Best Drosophila hit  CG6867 (6e-117)
Best Human hitolfactomedin-like protein 2B precursor (6e-27)
Best NR hit (blastp)  colmedin [Culex quinquefasciatus] (0.0)
Best NR hit (blastx)  AGAP005849-PA [Anopheles gambiae str. PEST] (2e-177)
GeneOntology terms


  
GO:0005576 extracellular region
GO:0031012 extracellular matrix
GO:0050840 extracellular matrix binding
GO:0030198 extracellular matrix organization
InterPro families





  
IPR007110 Immunoglobulin-like
IPR003112 Olfactomedin-like
IPR013783 Immunoglobulin-like fold
IPR008160 Collagen triple helix repeat
IPR013098 Immunoglobulin I-set
IPR003599 Immunoglobulin subtype
IPR003598 Immunoglobulin subtype 2
Orthology groupMCL12323

Nucleotide sequence:

ATGAGACCAGAACTAGAAGAGAAAGACTCCATAGAAATGAAAAGAACCGGTGCCAAGGGA
CCTGCCCCGGGAGACGACACTTGGGTTTGGCTGACGAGCTACTCCAGGGTCCCATACAAA
GTAGTTCAGGGGTTTTGCAAGGCTACTCAGGATTACTGTCCTCCTGGCGTCCAAGGACCA
AAAGGTCCCATGGGTCACCCAGGTCCAAAAGGAGACAGGGGGTCACCAGGGGAGGCTGGC
ATACCTGGTAGCCCAGGTTCAGTAGGACCTTTCGGACCCCCTGGACCAAAAGGCGAACGT
GGATTTCCGGGGAATCCTGGCTTAGATGGTAGAGATGGAGTGCCAGGAGAACCAGGACTT
GATGGCTTGCCGGGGCGGAATGGGGCAGACGGAGCCCCGGGTAGGTACGGACAAGACGGG
ATACCAGGCAGGGATGGAATCCCAGGAAAAAATGGAAAGGATGGAAAAGATGGAAAAGTC
GGAGCTCAGGGCCCACCTGGTATTCGAGGCCCTAAAGGCGAACGAGGTCCAATCGGCCCC
AAAGGCCCGAAGGGAAATGACGGACTTAACGGAATACCCGGCAAGCCAGGACTATCCATC
TATAACTACACCAAAGAAAACCAGATGTTCATTCCCCCTTCCTTTGCATTGGATAATCCG
AGACTTATAGTAAGAGAGGGGGATACTATGAGATTGGACTGCAATCCCAAAGGCTTCCCT
GAACCCATCATTGAATGGAGGAGAGCTGACGGCACACCCATTATTCAGGGTTCATGGCGT
GACGCCTCCGTCAGTGGTCACGTGCTTAACATACCAAACGTATCTCGTTGGCACACCGGC
AAGTATGTGTGTCTCGCTAACAACGGCATGCAGCCTCCCGCCAACCAGACCACGGATGTT
GAAGTTAATTTCAGCCCATACATCAGGGTGCCAAACAACATAGTCTACGTATTCAACAAA
ACTGCCCAAATCGAGTGCGAGATTCAAGCCTGGCCGGAGCCAGTGCTGGCTTGGGAGTAC
GACGATGGAACAACAGTCGAGGGATCACACTACAAGATTGAGGTGGCGCCAACACCGGAT
CCCTGGAGGTGGATCATGAAGCTGGAGATACCTCACATCAATGAGCACGACATGCGCCAG
TACATCTGCGTGGCCAAAAATGAACTCAATAACACAACCGTCAGAGGCTATATTAGACTG
TCCCATCCTGGTCCGAAACAACAATCTCAGATACAACAACAACCACGCGAGTTCGGCTCC
CCGCCGCCCACGTTGACCTCGTACGAAGAACTGTGCTCCGCCCAACGCTGCCCATCCTGC
CCACGATGTGATCGAGCGCTCATGATCACGCCCATGAACGCCAGCTTAGGCAACAAGCCT
CACCGGAATACCAATTGTCAGCTGTACGCGATCGGCAAACCAGTGTACCACAAGTACAAG
GAGGAGTTGTTTGGTGCCTGGCTAAGAGATTCGAATTCCTCTGAAGCTCAGCGAGAGAAG
CTGTGGACTACCCAGGAGAACGACGTGGAGAGATTGCACGAATTCCGGAGTAAGGCAAGC
TTCAAGTCGGATAGAGTAGACGAGTTCCACAAACTCCAGAAACCTTTCTTTGGTAATGGT
CACATAGTGTACAGCGGCTCTTTCTTCTATCAAGCCAACGAGTCCGGTACACCCGGCGAC
ATTGTGCGCTACGACCTGACACAAAGCCGTATCAAATCAGCACATCTACCGCACGCGCAG
GGCAGACTGTACACGGCACAACACAACCAAGTCGACTTCAGCGCCGACGACAACGGCCTC
TGGGCGATTTACTCCATAGAAGGTTCGAATAACACAGCAGTTGCTAAGCTGAGCTTTGAT
CCCAACAAGGATGATCTTAATATAGACTATATCTGGAACATCTCCTTAAATCATAAACAA
GTAGGTGAAATGTTCATAGTTTGCGGCGTCCTCTATGCGTTGGATTCCGCAACAGAACGC
GACAGCAAAGTATCGATCGCCATTGACTTGTACCTTAGCAAGTCGATCGATGTCACACTG
CAGTTCACGAATCCATTCAGAAAAACAACACAATTAGGCTACGATCACACGCATAAGGAA
CTATATTCCTGGGATAGGGGTAATCAGCTGACATATCCAGTCCGGTACAACGAACTTCCG
GGCCCCTAA

Protein sequence:

MRPELEEKDSIEMKRTGAKGPAPGDDTWVWLTSYSRVPYKVVQGFCKATQDYCPPGVQGP
KGPMGHPGPKGDRGSPGEAGIPGSPGSVGPFGPPGPKGERGFPGNPGLDGRDGVPGEPGL
DGLPGRNGADGAPGRYGQDGIPGRDGIPGKNGKDGKDGKVGAQGPPGIRGPKGERGPIGP
KGPKGNDGLNGIPGKPGLSIYNYTKENQMFIPPSFALDNPRLIVREGDTMRLDCNPKGFP
EPIIEWRRADGTPIIQGSWRDASVSGHVLNIPNVSRWHTGKYVCLANNGMQPPANQTTDV
EVNFSPYIRVPNNIVYVFNKTAQIECEIQAWPEPVLAWEYDDGTTVEGSHYKIEVAPTPD
PWRWIMKLEIPHINEHDMRQYICVAKNELNNTTVRGYIRLSHPGPKQQSQIQQQPREFGS
PPPTLTSYEELCSAQRCPSCPRCDRALMITPMNASLGNKPHRNTNCQLYAIGKPVYHKYK
EELFGAWLRDSNSSEAQREKLWTTQENDVERLHEFRSKASFKSDRVDEFHKLQKPFFGNG
HIVYSGSFFYQANESGTPGDIVRYDLTQSRIKSAHLPHAQGRLYTAQHNQVDFSADDNGL
WAIYSIEGSNNTAVAKLSFDPNKDDLNIDYIWNISLNHKQVGEMFIVCGVLYALDSATER
DSKVSIAIDLYLSKSIDVTLQFTNPFRKTTQLGYDHTHKELYSWDRGNQLTYPVRYNELP
GP