DPGLEAN00002 in OGS1.0

New model in OGS2.0DPOGS204574 
Genomic Positionscaffold4897:+ 2433-8311
See gene structure
CDS Length1875
Paired RNAseq reads  340
Single RNAseq reads  830
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001979 (4e-58)
Best Drosophila hit  CG6867 (2e-77)
Best Human hitneurotrimin isoform 2 (9e-13)
Best NR hit (blastp)  colmedin [Culex quinquefasciatus] (1e-123)
Best NR hit (blastx)  AGAP005849-PA [Anopheles gambiae str. PEST] (4e-112)
GeneOntology terms

  
GO:0005575 cellular_component
GO:0008150 biological_process
GO:0003674 molecular_function
InterPro families




  
IPR007110 Immunoglobulin-like
IPR013783 Immunoglobulin-like fold
IPR003599 Immunoglobulin subtype
IPR003598 Immunoglobulin subtype 2
IPR008160 Collagen triple helix repeat
IPR013098 Immunoglobulin I-set
Orthology groupMCL12323

Nucleotide sequence:

ATGACTTCGGACTTAATCAAGGATAATAAGAATAGAGGAGAAGTTGCTGCTACAGATGTG
CCGTGCTGCAAGAAAATTACTGTGTTTGCCTGTGTCTCTGGCGTTTTTTCATTAATTTTG
CATGCATATAGCTATGCAGAACTCTCGGCTATTAAAAGTCATGAAGAGTTACATTCTAGA
CACATAAATAAACTCATAGAAAATAGGATTCAAGAAAGATTTATAGAGTTAATGAGTACA
ACTAGCCCCCATAGATTAAAAAGAGATGCGATGCTTAAACAATCCCCCATCGAGGAGGAT
AATACGGTTGCTCCACACGTGGAATTTTTCAACCCTAAAATGAGACCAGAACTAGAAGAG
AAAGACTCCATAGAAATGAAAAGAACCGGTGCCAAGGGACCTGCCCCGGGAGACGACACT
TGGGTTTGGCTGACGAGCTACTCCAGGGTCCCATACAAAGTAGTTCAAGGGTTTTGCAAG
GCTACACAAGATTACTGTCCTCCTGGCGTCCAAGGACCAAAAGGTCCCATGGGTCACCCA
GGTCCAAAAGGAGACAGGGGGTCACCAGGGGAGGCTGGCATACCTGGTAGCCCAGGTTCA
GTAGGACCTTTCGGACCCCCTGGACCAAAAGGCGAACGTGGATTCCCGGGCAACCCTGGC
TTAGATGGTAGAGATGGAGTGCCGGGAGAACCAGGACTTGACGGCTTGCCGGGGCGGAAT
GGGGCCGACGGAGCCCCGGGTAGGTACGGACAAGACGGGATACCAGGCAGGGATGGAATC
CCAGGAAAAAATGGAAAGGATGGAAAAGATGGAAAAGTCGGAGCTCAGGGCCCACCTGGT
ATTCGAGGCCCTAAAGGCGAACGAGGTCCAATCGGCCCCAAAGGCCCGAAGGGAAATGAC
GGACTTAACGGAATACCCGGCAAGCCAGGACTATCCATCTATAACTACACCAAAGAAAAC
CAGATGTTCATTCCCCCTTCCTTTGCATTGGATAATCCGAGACTTATAGTAAGAGAGGGG
GATACTATGAGATTGGACTGCAATCCCAAAGGCTTCCCTGAACCCATTATTGAATGGAGG
AGAGCTGACGGCACACCCATTATTCAGGGTTCATGGCGTGACGCCTCCGTCAGTGGTCAC
GTGCTTAACATACCAAACGTATCTCGTTGGCACACCGGCAAGTATGTATGTCTCGCTAAC
AACGGCATGCAGCCTCCCGCTAACCAGACCACGGACGTTGAAGTCAATTTCAGCCCATAC
ATAAGGGTGCCAAACAACATAGTCTACGTATTCAACAAAACTGCCCAAATCGAGTGCGAG
ATTCAAGCCTGGCCGGAGCCAGTGCTGGCTTGGGAGTACGACGATGGAACAACAGTCGAG
GGATCACACTACAAGATTGAGGTGGCGCCAACACCGGATCCCTGGAGGTGGATCATGAAG
CTGGAGATACCTCACATCAATGAGCACGACATGCGCCAGTACATCTGCGTGGCCAAAAAT
GAACTCAATAACACAACCGTCAGAGGCTATATTAGACTGTCCCATCCTGGTCCGAAACAA
CAATCTCAGATACAACAACAACCACGCGAGTTCGGCTCCCCTCCGCCCACGTTGACCTCG
TACGAAGAACTGTGCTCCGCCCAACGCTGCCCATCCTGCCCACGATGTGATCGAGCGCTC
ATGATCACGCCCATGAACGCCAGCTTAGGCAACAAGCCTCACCGGAATACCAATTGTCAG
CTGTACGCGATCGGCAAACCAGTGTACCACAAGTACAAGGAGGAGTTGTTTGGTGCCTGG
CTAAGAGATTCGAATTCCTCTGAAGCTCAGGTAAACAAGGAAAAAGAGACCAATTATACT
ACCCCTAGCTCGTGA

Protein sequence:

MTSDLIKDNKNRGEVAATDVPCCKKITVFACVSGVFSLILHAYSYAELSAIKSHEELHSR
HINKLIENRIQERFIELMSTTSPHRLKRDAMLKQSPIEEDNTVAPHVEFFNPKMRPELEE
KDSIEMKRTGAKGPAPGDDTWVWLTSYSRVPYKVVQGFCKATQDYCPPGVQGPKGPMGHP
GPKGDRGSPGEAGIPGSPGSVGPFGPPGPKGERGFPGNPGLDGRDGVPGEPGLDGLPGRN
GADGAPGRYGQDGIPGRDGIPGKNGKDGKDGKVGAQGPPGIRGPKGERGPIGPKGPKGND
GLNGIPGKPGLSIYNYTKENQMFIPPSFALDNPRLIVREGDTMRLDCNPKGFPEPIIEWR
RADGTPIIQGSWRDASVSGHVLNIPNVSRWHTGKYVCLANNGMQPPANQTTDVEVNFSPY
IRVPNNIVYVFNKTAQIECEIQAWPEPVLAWEYDDGTTVEGSHYKIEVAPTPDPWRWIMK
LEIPHINEHDMRQYICVAKNELNNTTVRGYIRLSHPGPKQQSQIQQQPREFGSPPPTLTS
YEELCSAQRCPSCPRCDRALMITPMNASLGNKPHRNTNCQLYAIGKPVYHKYKEELFGAW
LRDSNSSEAQVNKEKETNYTTPSS