DPGLEAN17021 in OGS1.0

New model in OGS2.0DPOGS202937 
Genomic Positionscaffold301:+ 5427-12996
See gene structure
CDS Length1773
Paired RNAseq reads  730
Single RNAseq reads  1625
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001906 (1e-121)
Best Drosophila hit  CG6232, isoform B (3e-44)
Best Human hitthrombospondin type-1 domain-containing protein 4 precursor (9e-47)
Best NR hit (blastp)  PREDICTED: similar to thrombospondin repeat protein 1 [Apis mellifera] (3e-59)
Best NR hit (blastx)  PREDICTED: similar to thrombospondin repeat protein 1 [Apis mellifera] (5e-66)
GeneOntology terms


  
GO:0004222 metalloendopeptidase activity
GO:0005576 extracellular region
GO:0005578 proteinaceous extracellular matrix
GO:0008233 peptidase activity
InterPro families

  
IPR000884 Thrombospondin, type 1 repeat
IPR010909 PLAC
IPR010294 ADAM-TS Spacer 1
Orthology groupMCL39227

Nucleotide sequence:

ATGGCGCCTCTATCGAGATTATTCATCATTTTAGTGATTACCGTGGTAGGTGGTGAGGTA
CTGTCGGTGGTAGGTAGTCGCGATGTTCGCTGCGGGCGGCGGCTGGTATCCGGTCTGTTC
GCACGGCCACGTCTACCACTCGGCTACTCCTATGTTACCACCGTCCCCTCTGGCGCCTGT
CGACTCAACGTCTCGGAGATACTCGCCAGCGATAATTACATCGCTTTGAAAATAACGAAT
GGTTCGTTTATAATGAACGGTGAATTCGCCGTCAGTAGTCCTGGTACATACGAGGCAGCT
GGTGCAAGATTCGTTTACAGCAGAAAAACAGGTCTAGATTCAGTGTATGCACTTGGACCT
ACCCACGATTCTATCGATATTATGATATTATACACTCAACCAAATCCAAGTATAAAATAC
GAATATTTCACCGAATCGTTGCCAGGTGAAGTTGAAACTGAATCATTGACAGTGTCCCCA
CCAGAACCAACTGTCGTACCCAAACATTCGAGACGTCATCACGGTATAGAATACGCTAAA
GCAGGAGCCCGGCATTTGGATCCAGGAGTCAAAGATAAAAACAACGTTGAGGAAAATGTC
GTAGCTGGAAGAAAATTTGTATGGAAGATACTTGCGTATACTCAGTGTTCTAGAAGCTGC
GGTGGTGGTATTCAGCTAGGAAAATACAGGTGCGTAGAAGTATCCTCTAGTGATTGGGAG
GTGTCCCCAGCACATTGTTTGGGTTCCCCTCCGTCAGGTAGACGTCGTCGTTGTGGAACC
ATTCCTTGCGCTCCGAGATGGCGGGCCGCTAGCTGGTCTCCATGCCCGTCCTGTGGACCA
GCGACAAAGAATAGGATCGTTGGATGTGTACAAGATCATTCAAGAGGAATTACTAAGGTA
AGCGATCAAAAATGTTTGGCTTCAAAACCGGCGACCACAGAAGATTGTAACATCCCAGAT
TGTAAAAACCCTGGAATACGGCACACAGAGGCGAAGCCCCAGGAACATACAGATGCCTTC
CACGATGGTTCAGTGTACACAGTCGATGTTAATACCACGGACACGGAACTTGGACCAGAA
TATAGTTTCACTTCCATTAGAGGATGGCTTTTCACCGATTGGTCTGAGTGTGTAGGATGG
TGTGTAGGCGGTGGTTTGAAGACCAGGTCTGTTCGATGTGGTGATCCCTCAGGTTGTGCA
GGACCCTCCCCGGAGACGTCTCAAGACTGTGTCCCTTCAGTGACATGTGAACCCCACGAT
GGCCGCTGGTTTGCAGGGGACTGGTCGAAGTGCTCGTCCCCTTGCGGGAAGCAGATCCGA
GTGGTGTTATGTATCGGAGGTACCGGAAGGCATCTGAGGGACTCCGCCTGTAGGGACCCT
CGGCCAGAACACGAGAGGAACTGCCCCGGAGAATGCCCAGCGACGTGGTATTACAGCGAA
TGGGGTCAGTGTACAGGTAACTGTAGTATTGGCCTGGGCGTCCAACGCCGATGGGTGTCG
TGTGTGAGGAATGATGTCACCGTCAGCGAAACTGAGTGTACGACACCACCACCGACACCA
CACAGATCCTGTATCCCGTCATGTATACCACCAGATCTCGTTATAGAGTCTCAAAAATCA
ACGAACGATCAATCGACAATGAAGCCGAGACCACAAACAGTTCCATCGGGGAAAGACTGC
GAGGACAAATTGACGAACTGCGCTCTAGCCGTACAGGCGCGACTGTGCCATTACAAATAC
TACATCCACAACTGCTGCGATTCTTGTAAATAA

Protein sequence:

MAPLSRLFIILVITVVGGEVLSVVGSRDVRCGRRLVSGLFARPRLPLGYSYVTTVPSGAC
RLNVSEILASDNYIALKITNGSFIMNGEFAVSSPGTYEAAGARFVYSRKTGLDSVYALGP
THDSIDIMILYTQPNPSIKYEYFTESLPGEVETESLTVSPPEPTVVPKHSRRHHGIEYAK
AGARHLDPGVKDKNNVEENVVAGRKFVWKILAYTQCSRSCGGGIQLGKYRCVEVSSSDWE
VSPAHCLGSPPSGRRRRCGTIPCAPRWRAASWSPCPSCGPATKNRIVGCVQDHSRGITKV
SDQKCLASKPATTEDCNIPDCKNPGIRHTEAKPQEHTDAFHDGSVYTVDVNTTDTELGPE
YSFTSIRGWLFTDWSECVGWCVGGGLKTRSVRCGDPSGCAGPSPETSQDCVPSVTCEPHD
GRWFAGDWSKCSSPCGKQIRVVLCIGGTGRHLRDSACRDPRPEHERNCPGECPATWYYSE
WGQCTGNCSIGLGVQRRWVSCVRNDVTVSETECTTPPPTPHRSCIPSCIPPDLVIESQKS
TNDQSTMKPRPQTVPSGKDCEDKLTNCALAVQARLCHYKYYIHNCCDSCK