New model in OGS2.0 | DPOGS200340  |
---|---|
Genomic Position | scaffold640:- 100226-130887 |
See gene structure | |
CDS Length | 3756 |
Paired RNAseq reads   | 1007 |
Single RNAseq reads   | 2290 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005633 (1e-07) |
Best Drosophila hit   | ND |
Best Human hit | thrombospondin type-1 domain-containing protein 7B (8e-54) |
Best NR hit (blastp)   | HM00052 [Heliconius melpomene] (0.0) |
Best NR hit (blastx)   | HM00052 [Heliconius melpomene] (7e-152) |
GeneOntology terms    | GO:0016021 integral to membrane GO:0016020 membrane |
InterPro families   | IPR000884 Thrombospondin, type 1 repeat |
Orthology group | MCL19291 |
Nucleotide sequence:
ATGCATGCATTAGTGATATTGCTGCTTGCAGCGGCAGTTGCAGCTGATGAAGTTACCGAC
CCAAACAGCCTTCTAGATCCGGCTCCAGAGCCGGTGGTCATTGAAGGGGAGTACACGGTG
TATGTGGGCACATGGACAGAATGTAGTACTGGCGACGGAGATCAAGCTAAGTCTTTCGAC
AGATATAACGCTCGTCTGGAATCCGATTCTCTGGTGCACACTCCACGTCTGGGACTTCAG
AGACGTCAAGTGCAATGTCGGAGAAAAGATGGCAGATTTGTAGAAGCTCTGTACTGCGGG
ATTGCCATAGCAAACATCGGCACAACTCGCGTGTGCGTGATGCGTGAGGACTGTTCCTTG
GCTGAATGGTTGCCATGGCGACCAAGATCTGATGGAGCTCTAGTAAGAACAAGACGTTTA
CGAAGACTGTCACAAGGCGGAGGCAAAGAATGCGACGTGGTCGAAGAAGTACGACCTACT
GTTTTGGAGACTACGGCACATTGGACTCCGGGTCCTTGGGGACCATGCCGTGTTGCTGTG
GAACAAGCGGCCACTGCTGCACCAACTGATGATGACGATGACGCGGATGACAATGACGCG
ATTTACGACGAGGATGACGAGAGTGACGAAAGCGAGCAGTCAAGCTGTGGGGGCGGAGTG
CAGCGACGTGCTGCGACCTGCGTCCGGGCAGACGGGCGAGCGTTGCACGACGCGCAGTGT
GCTCATGCAGTTATGCCGACACTCGTGCAACCTTGCGAGGTACCTTGCCCACGTGATTGT
GAAGTTGGAGAATGGAGTGAATGGGGTGCCTGTCAGCCTACTGACGGATGTCCTCTTTAC
CCAGTACAACAACTCACAACTACTGGGTACAGCGTACGTCGTCGAAGAGTAACAGCAGCA
GCATCCGGTGGAGGGGCGCCATGTCCTCCTCTAGAGGAAAAACGTACTTGCACTACACCA
AGATGTGCTGCGTGGAAAGCACTGCCGTGGGGCCCCTGTGTATTGACTCAACCCCATACT
AGTTGTGGACCAGGCCGACGTACCAGAGAACTTAGATGCATGGGACACGATGGAAAGGAA
GCTCAACGAGCGTGGTGTAACACTGGCGCTCCACCACGCAGTGAACGATGTCGTATCGCT
TGTCCGGGAGACTGTGTAGTGTCTGCATGGGCGGAGTGGTCACCTTGTTCAGCGAGTTGC
GTCGCCCCCGGACATGCACGACCCACGCGTACTAGACGCAGACATATACTTGCCCACGCT
GCACCAAATGGCTGGCCCTGCCCATCTGAAGACCAACTGATTCAAAACGAGACATGCAAC
ACCCACGCTTGTGCCACTTATTCCTGGCTTGCAACGCCTTGGGGACCTTGTGAGCGTCGC
CGGCAAGATTTCATACCGGCTACCAATTACACAGATCTTCTGGATAATGAACCGTTCAAT
GAAAGCGATGACGAAGAACCTTGTATTGAAGAAGGTGAAATGAGCAGAGACGTTATGTGT
GTTCAGAATAATGCTGACGTTGTCAGAGAAGCTCTATGTGCTCCATTACGTCGCCCAGCG
TCCCGTCGGGCGTGTACTGTAAGATGTCGACGAGGTTGCAGGGTTGAAGCTTGGATGCCC
TGGTCTCCATGTCCTGATACTTGTGACCCTGGTAAGCAGGTCCGCGTTCGCACCGTCCGA
GGTGGTCCGAACTGCGGTCCGTTGCAGGAGACACGCGACTGTCCCGTTTCGAGGTCGTGT
CGTTCCCGTGAGGCTGTCTGGGTCGCCGGGGAGTGGAGTACCTGTAGATTACCGCCAGGA
CAACGCTGTGGAGTTGGCTATAGGATTAGAAGTATCTGGTGCGGCTCGGACTCTCACCGC
GTCGAGGCCGGCGCATGTGCTGGTGCCCGGGTGCCGCCCGCTGCAGCAGCCTGCAGCGTC
ACATGCGACACCATCGTACCACTCACTTGTGATATCATATGTTCAGATCCCCTAAAATAC
TTGGATGCCTCTGACCCCGACGTACCCTCATGCGTCTGCAAGAATGTCTCATTGGAACTG
TTACCCGCTGATTCAGACTGTATTCTTCCACCTGGAATTGAATGCGGTGAAGGGAGATCA
CTGCGGGCAGCTCGTTGTTTAGTTGGAAGACGTGATGTACCCATGGATGTTTGTAGGAAA
TACCATCCCCTTACAGGACCCCGTCGCGTTCGTGAAGCAGCAACAGACGGCTTCACATAT
GATGAGGAATTCACATCTTTATTACGCGGTGCATGTAGCGTGCGGTGTGCGAGGGACTGT
GCGGTCGGGGCGTGGGCTGCCTGGGGACCGTGTGCTGCTGAGCCGGGTTCCAGAGCTGCT
TTCAGGTTCCGCACCAGGGAAGTAATAGAGGAAGGTTCGGCTGGTGGTCGTGAATGTGGC
GCCACATTGCAGCGCTCTACGTGCGTTGTGACTGAGCCACGATGGATACTGGGCGAGTGG
TCTGTGTGCGCTCCGAGACGAGCTCTATGTGGACGAGCCATTATCAATAGGACTGTTATG
TGCATAGATGCGGATGGGAATAAATTGGAGGACACACAGTGTGAGGCGGCCGGCGCTGGT
CCTGCGCCCTCTCGCGATGCGACATGTCGGGCTCCGTGCCCTTCTGACTGTGTTGTCAGC
TCTTGGTCAGACTGGAGTCCATGTGAACAGACGAAATGGGGCGGTCGTCGTGATAGGACT
CGTGTGGTTCTCCGCGCGGCTGCTGAGGGCGGGACTGCCTGCCCTCACCTGGTGGCTGCG
GAGCCTTGTTCACCGCACGCCTACTCCTGGCACGTGGCACCCTGGGATGACTGTCAACCG
CTGGGTGGGTCTCCGTGTGGGGAAGGAACAAAGAGAAGAGCTGTACGGTGCCTTCGCAGC
GATGGTGTTTTCGTAAATGATTCATTCTGTCCGAACGCAACGGCATCCGAGGCTCGGGAG
TCATGGTGCTACGTTCCATGTGGCGTAGACTGTGAGGTTGGAGAGTGGGGACCTTGGGAC
GCCTCCGCCTGCTCCTGCGGGGACGCAGTCACAGCACGCCACATGAGACGGATACGTCAA
CACTTGACGGCAGCTGTATGGCCGGGTCGCGCGTGCCCTCCCACTGAGCAACGAGCTCCC
TGCCCGCGAGAACCATGCTTGAGACTCGTCGCTAGACCGCTATTAGGATGTCATGTACAA
ACGTCATCAGGAGAAGAAGCTGATAATGCATGCGGATGGGGAGTGAAGTTATCTCATGCA
AGATGTGAACTGACTAGCATCAACGATGAACCGTCATCAGGAGCCTTCTTACAACCCTGG
AGATGTGCCTCCGCTCTACCGGGACGTATCGTTACACCGCCAATGCATCATCAGGAGGAC
GAGGAGTGTGAGGTCGAATGTGGATGCCAGGAATCTGAGCTGGGGCAGCCGGGTCCGTGG
GGCGCTTGGGGCGGCTGCCGTGGTGGGGCACGTTCGAGGACACGTACACTACTGGTACCA
CCCCGAAGAGCCTGCAGAACATCCTCCAGATACATAACAATCGAGTGGTCGAACTGCACC
GAGGAGGCTTCGGAGGCGACAGCTGGTGGTGACGGAACGCGAGGCGCCTGGCTTTCAGAA
CACTACCATGACGGATATATAGAGGGAAGTACTTCAGTGTTGGCGGTAGTGTGGACTGCG
ACCATAATACTCAGCTTGTATGGCGCGTTCATGCTCTATCGTGGACTTCTAAGATGCATC
AGAAGCAGAAAAATGAAGAGCATCACTAAAGTGTAA
Protein sequence:
MHALVILLLAAAVAADEVTDPNSLLDPAPEPVVIEGEYTVYVGTWTECSTGDGDQAKSFD
RYNARLESDSLVHTPRLGLQRRQVQCRRKDGRFVEALYCGIAIANIGTTRVCVMREDCSL
AEWLPWRPRSDGALVRTRRLRRLSQGGGKECDVVEEVRPTVLETTAHWTPGPWGPCRVAV
EQAATAAPTDDDDDADDNDAIYDEDDESDESEQSSCGGGVQRRAATCVRADGRALHDAQC
AHAVMPTLVQPCEVPCPRDCEVGEWSEWGACQPTDGCPLYPVQQLTTTGYSVRRRRVTAA
ASGGGAPCPPLEEKRTCTTPRCAAWKALPWGPCVLTQPHTSCGPGRRTRELRCMGHDGKE
AQRAWCNTGAPPRSERCRIACPGDCVVSAWAEWSPCSASCVAPGHARPTRTRRRHILAHA
APNGWPCPSEDQLIQNETCNTHACATYSWLATPWGPCERRRQDFIPATNYTDLLDNEPFN
ESDDEEPCIEEGEMSRDVMCVQNNADVVREALCAPLRRPASRRACTVRCRRGCRVEAWMP
WSPCPDTCDPGKQVRVRTVRGGPNCGPLQETRDCPVSRSCRSREAVWVAGEWSTCRLPPG
QRCGVGYRIRSIWCGSDSHRVEAGACAGARVPPAAAACSVTCDTIVPLTCDIICSDPLKY
LDASDPDVPSCVCKNVSLELLPADSDCILPPGIECGEGRSLRAARCLVGRRDVPMDVCRK
YHPLTGPRRVREAATDGFTYDEEFTSLLRGACSVRCARDCAVGAWAAWGPCAAEPGSRAA
FRFRTREVIEEGSAGGRECGATLQRSTCVVTEPRWILGEWSVCAPRRALCGRAIINRTVM
CIDADGNKLEDTQCEAAGAGPAPSRDATCRAPCPSDCVVSSWSDWSPCEQTKWGGRRDRT
RVVLRAAAEGGTACPHLVAAEPCSPHAYSWHVAPWDDCQPLGGSPCGEGTKRRAVRCLRS
DGVFVNDSFCPNATASEARESWCYVPCGVDCEVGEWGPWDASACSCGDAVTARHMRRIRQ
HLTAAVWPGRACPPTEQRAPCPREPCLRLVARPLLGCHVQTSSGEEADNACGWGVKLSHA
RCELTSINDEPSSGAFLQPWRCASALPGRIVTPPMHHQEDEECEVECGCQESELGQPGPW
GAWGGCRGGARSRTRTLLVPPRRACRTSSRYITIEWSNCTEEASEATAGGDGTRGAWLSE
HYHDGYIEGSTSVLAVVWTATIILSLYGAFMLYRGLLRCIRSRKMKSITKV