New model in OGS2.0 | DPOGS204574  |
---|---|
Genomic Position | scaffold4897:+ 2433-8311 |
See gene structure | |
CDS Length | 1875 |
Paired RNAseq reads   | 340 |
Single RNAseq reads   | 830 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA001979 (4e-58) |
Best Drosophila hit   | CG6867 (2e-77) |
Best Human hit | neurotrimin isoform 2 (9e-13) |
Best NR hit (blastp)   | colmedin [Culex quinquefasciatus] (1e-123) |
Best NR hit (blastx)   | AGAP005849-PA [Anopheles gambiae str. PEST] (4e-112) |
GeneOntology terms    | GO:0005575 cellular_component GO:0008150 biological_process GO:0003674 molecular_function |
InterPro families    | IPR007110 Immunoglobulin-like IPR013783 Immunoglobulin-like fold IPR003599 Immunoglobulin subtype IPR003598 Immunoglobulin subtype 2 IPR008160 Collagen triple helix repeat IPR013098 Immunoglobulin I-set |
Orthology group | MCL12323 |
Nucleotide sequence:
ATGACTTCGGACTTAATCAAGGATAATAAGAATAGAGGAGAAGTTGCTGCTACAGATGTG
CCGTGCTGCAAGAAAATTACTGTGTTTGCCTGTGTCTCTGGCGTTTTTTCATTAATTTTG
CATGCATATAGCTATGCAGAACTCTCGGCTATTAAAAGTCATGAAGAGTTACATTCTAGA
CACATAAATAAACTCATAGAAAATAGGATTCAAGAAAGATTTATAGAGTTAATGAGTACA
ACTAGCCCCCATAGATTAAAAAGAGATGCGATGCTTAAACAATCCCCCATCGAGGAGGAT
AATACGGTTGCTCCACACGTGGAATTTTTCAACCCTAAAATGAGACCAGAACTAGAAGAG
AAAGACTCCATAGAAATGAAAAGAACCGGTGCCAAGGGACCTGCCCCGGGAGACGACACT
TGGGTTTGGCTGACGAGCTACTCCAGGGTCCCATACAAAGTAGTTCAAGGGTTTTGCAAG
GCTACACAAGATTACTGTCCTCCTGGCGTCCAAGGACCAAAAGGTCCCATGGGTCACCCA
GGTCCAAAAGGAGACAGGGGGTCACCAGGGGAGGCTGGCATACCTGGTAGCCCAGGTTCA
GTAGGACCTTTCGGACCCCCTGGACCAAAAGGCGAACGTGGATTCCCGGGCAACCCTGGC
TTAGATGGTAGAGATGGAGTGCCGGGAGAACCAGGACTTGACGGCTTGCCGGGGCGGAAT
GGGGCCGACGGAGCCCCGGGTAGGTACGGACAAGACGGGATACCAGGCAGGGATGGAATC
CCAGGAAAAAATGGAAAGGATGGAAAAGATGGAAAAGTCGGAGCTCAGGGCCCACCTGGT
ATTCGAGGCCCTAAAGGCGAACGAGGTCCAATCGGCCCCAAAGGCCCGAAGGGAAATGAC
GGACTTAACGGAATACCCGGCAAGCCAGGACTATCCATCTATAACTACACCAAAGAAAAC
CAGATGTTCATTCCCCCTTCCTTTGCATTGGATAATCCGAGACTTATAGTAAGAGAGGGG
GATACTATGAGATTGGACTGCAATCCCAAAGGCTTCCCTGAACCCATTATTGAATGGAGG
AGAGCTGACGGCACACCCATTATTCAGGGTTCATGGCGTGACGCCTCCGTCAGTGGTCAC
GTGCTTAACATACCAAACGTATCTCGTTGGCACACCGGCAAGTATGTATGTCTCGCTAAC
AACGGCATGCAGCCTCCCGCTAACCAGACCACGGACGTTGAAGTCAATTTCAGCCCATAC
ATAAGGGTGCCAAACAACATAGTCTACGTATTCAACAAAACTGCCCAAATCGAGTGCGAG
ATTCAAGCCTGGCCGGAGCCAGTGCTGGCTTGGGAGTACGACGATGGAACAACAGTCGAG
GGATCACACTACAAGATTGAGGTGGCGCCAACACCGGATCCCTGGAGGTGGATCATGAAG
CTGGAGATACCTCACATCAATGAGCACGACATGCGCCAGTACATCTGCGTGGCCAAAAAT
GAACTCAATAACACAACCGTCAGAGGCTATATTAGACTGTCCCATCCTGGTCCGAAACAA
CAATCTCAGATACAACAACAACCACGCGAGTTCGGCTCCCCTCCGCCCACGTTGACCTCG
TACGAAGAACTGTGCTCCGCCCAACGCTGCCCATCCTGCCCACGATGTGATCGAGCGCTC
ATGATCACGCCCATGAACGCCAGCTTAGGCAACAAGCCTCACCGGAATACCAATTGTCAG
CTGTACGCGATCGGCAAACCAGTGTACCACAAGTACAAGGAGGAGTTGTTTGGTGCCTGG
CTAAGAGATTCGAATTCCTCTGAAGCTCAGGTAAACAAGGAAAAAGAGACCAATTATACT
ACCCCTAGCTCGTGA
Protein sequence:
MTSDLIKDNKNRGEVAATDVPCCKKITVFACVSGVFSLILHAYSYAELSAIKSHEELHSR
HINKLIENRIQERFIELMSTTSPHRLKRDAMLKQSPIEEDNTVAPHVEFFNPKMRPELEE
KDSIEMKRTGAKGPAPGDDTWVWLTSYSRVPYKVVQGFCKATQDYCPPGVQGPKGPMGHP
GPKGDRGSPGEAGIPGSPGSVGPFGPPGPKGERGFPGNPGLDGRDGVPGEPGLDGLPGRN
GADGAPGRYGQDGIPGRDGIPGKNGKDGKDGKVGAQGPPGIRGPKGERGPIGPKGPKGND
GLNGIPGKPGLSIYNYTKENQMFIPPSFALDNPRLIVREGDTMRLDCNPKGFPEPIIEWR
RADGTPIIQGSWRDASVSGHVLNIPNVSRWHTGKYVCLANNGMQPPANQTTDVEVNFSPY
IRVPNNIVYVFNKTAQIECEIQAWPEPVLAWEYDDGTTVEGSHYKIEVAPTPDPWRWIMK
LEIPHINEHDMRQYICVAKNELNNTTVRGYIRLSHPGPKQQSQIQQQPREFGSPPPTLTS
YEELCSAQRCPSCPRCDRALMITPMNASLGNKPHRNTNCQLYAIGKPVYHKYKEELFGAW
LRDSNSSEAQVNKEKETNYTTPSS