New model in OGS2.0 | DPOGS202937  |
---|---|
Genomic Position | scaffold301:+ 5427-12996 |
See gene structure | |
CDS Length | 1773 |
Paired RNAseq reads   | 730 |
Single RNAseq reads   | 1625 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA001906 (1e-121) |
Best Drosophila hit   | CG6232, isoform B (3e-44) |
Best Human hit | thrombospondin type-1 domain-containing protein 4 precursor (9e-47) |
Best NR hit (blastp)   | PREDICTED: similar to thrombospondin repeat protein 1 [Apis mellifera] (3e-59) |
Best NR hit (blastx)   | PREDICTED: similar to thrombospondin repeat protein 1 [Apis mellifera] (5e-66) |
GeneOntology terms    | GO:0004222 metalloendopeptidase activity GO:0005576 extracellular region GO:0005578 proteinaceous extracellular matrix GO:0008233 peptidase activity |
InterPro families    | IPR000884 Thrombospondin, type 1 repeat IPR010909 PLAC IPR010294 ADAM-TS Spacer 1 |
Orthology group | MCL39227 |
Nucleotide sequence:
ATGGCGCCTCTATCGAGATTATTCATCATTTTAGTGATTACCGTGGTAGGTGGTGAGGTA
CTGTCGGTGGTAGGTAGTCGCGATGTTCGCTGCGGGCGGCGGCTGGTATCCGGTCTGTTC
GCACGGCCACGTCTACCACTCGGCTACTCCTATGTTACCACCGTCCCCTCTGGCGCCTGT
CGACTCAACGTCTCGGAGATACTCGCCAGCGATAATTACATCGCTTTGAAAATAACGAAT
GGTTCGTTTATAATGAACGGTGAATTCGCCGTCAGTAGTCCTGGTACATACGAGGCAGCT
GGTGCAAGATTCGTTTACAGCAGAAAAACAGGTCTAGATTCAGTGTATGCACTTGGACCT
ACCCACGATTCTATCGATATTATGATATTATACACTCAACCAAATCCAAGTATAAAATAC
GAATATTTCACCGAATCGTTGCCAGGTGAAGTTGAAACTGAATCATTGACAGTGTCCCCA
CCAGAACCAACTGTCGTACCCAAACATTCGAGACGTCATCACGGTATAGAATACGCTAAA
GCAGGAGCCCGGCATTTGGATCCAGGAGTCAAAGATAAAAACAACGTTGAGGAAAATGTC
GTAGCTGGAAGAAAATTTGTATGGAAGATACTTGCGTATACTCAGTGTTCTAGAAGCTGC
GGTGGTGGTATTCAGCTAGGAAAATACAGGTGCGTAGAAGTATCCTCTAGTGATTGGGAG
GTGTCCCCAGCACATTGTTTGGGTTCCCCTCCGTCAGGTAGACGTCGTCGTTGTGGAACC
ATTCCTTGCGCTCCGAGATGGCGGGCCGCTAGCTGGTCTCCATGCCCGTCCTGTGGACCA
GCGACAAAGAATAGGATCGTTGGATGTGTACAAGATCATTCAAGAGGAATTACTAAGGTA
AGCGATCAAAAATGTTTGGCTTCAAAACCGGCGACCACAGAAGATTGTAACATCCCAGAT
TGTAAAAACCCTGGAATACGGCACACAGAGGCGAAGCCCCAGGAACATACAGATGCCTTC
CACGATGGTTCAGTGTACACAGTCGATGTTAATACCACGGACACGGAACTTGGACCAGAA
TATAGTTTCACTTCCATTAGAGGATGGCTTTTCACCGATTGGTCTGAGTGTGTAGGATGG
TGTGTAGGCGGTGGTTTGAAGACCAGGTCTGTTCGATGTGGTGATCCCTCAGGTTGTGCA
GGACCCTCCCCGGAGACGTCTCAAGACTGTGTCCCTTCAGTGACATGTGAACCCCACGAT
GGCCGCTGGTTTGCAGGGGACTGGTCGAAGTGCTCGTCCCCTTGCGGGAAGCAGATCCGA
GTGGTGTTATGTATCGGAGGTACCGGAAGGCATCTGAGGGACTCCGCCTGTAGGGACCCT
CGGCCAGAACACGAGAGGAACTGCCCCGGAGAATGCCCAGCGACGTGGTATTACAGCGAA
TGGGGTCAGTGTACAGGTAACTGTAGTATTGGCCTGGGCGTCCAACGCCGATGGGTGTCG
TGTGTGAGGAATGATGTCACCGTCAGCGAAACTGAGTGTACGACACCACCACCGACACCA
CACAGATCCTGTATCCCGTCATGTATACCACCAGATCTCGTTATAGAGTCTCAAAAATCA
ACGAACGATCAATCGACAATGAAGCCGAGACCACAAACAGTTCCATCGGGGAAAGACTGC
GAGGACAAATTGACGAACTGCGCTCTAGCCGTACAGGCGCGACTGTGCCATTACAAATAC
TACATCCACAACTGCTGCGATTCTTGTAAATAA
Protein sequence:
MAPLSRLFIILVITVVGGEVLSVVGSRDVRCGRRLVSGLFARPRLPLGYSYVTTVPSGAC
RLNVSEILASDNYIALKITNGSFIMNGEFAVSSPGTYEAAGARFVYSRKTGLDSVYALGP
THDSIDIMILYTQPNPSIKYEYFTESLPGEVETESLTVSPPEPTVVPKHSRRHHGIEYAK
AGARHLDPGVKDKNNVEENVVAGRKFVWKILAYTQCSRSCGGGIQLGKYRCVEVSSSDWE
VSPAHCLGSPPSGRRRRCGTIPCAPRWRAASWSPCPSCGPATKNRIVGCVQDHSRGITKV
SDQKCLASKPATTEDCNIPDCKNPGIRHTEAKPQEHTDAFHDGSVYTVDVNTTDTELGPE
YSFTSIRGWLFTDWSECVGWCVGGGLKTRSVRCGDPSGCAGPSPETSQDCVPSVTCEPHD
GRWFAGDWSKCSSPCGKQIRVVLCIGGTGRHLRDSACRDPRPEHERNCPGECPATWYYSE
WGQCTGNCSIGLGVQRRWVSCVRNDVTVSETECTTPPPTPHRSCIPSCIPPDLVIESQKS
TNDQSTMKPRPQTVPSGKDCEDKLTNCALAVQARLCHYKYYIHNCCDSCK