DPGLEAN04560 in OGS1.0

New model in OGS2.0DPOGS200340 
Genomic Positionscaffold640:- 100226-130887
See gene structure
CDS Length3756
Paired RNAseq reads  1007
Single RNAseq reads  2290
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005633 (1e-07)
Best Drosophila hit  ND
Best Human hitthrombospondin type-1 domain-containing protein 7B (8e-54)
Best NR hit (blastp)  HM00052 [Heliconius melpomene] (0.0)
Best NR hit (blastx)  HM00052 [Heliconius melpomene] (7e-152)
GeneOntology terms
  
GO:0016021 integral to membrane
GO:0016020 membrane
InterPro families  IPR000884 Thrombospondin, type 1 repeat
Orthology groupMCL19291

Nucleotide sequence:

ATGCATGCATTAGTGATATTGCTGCTTGCAGCGGCAGTTGCAGCTGATGAAGTTACCGAC
CCAAACAGCCTTCTAGATCCGGCTCCAGAGCCGGTGGTCATTGAAGGGGAGTACACGGTG
TATGTGGGCACATGGACAGAATGTAGTACTGGCGACGGAGATCAAGCTAAGTCTTTCGAC
AGATATAACGCTCGTCTGGAATCCGATTCTCTGGTGCACACTCCACGTCTGGGACTTCAG
AGACGTCAAGTGCAATGTCGGAGAAAAGATGGCAGATTTGTAGAAGCTCTGTACTGCGGG
ATTGCCATAGCAAACATCGGCACAACTCGCGTGTGCGTGATGCGTGAGGACTGTTCCTTG
GCTGAATGGTTGCCATGGCGACCAAGATCTGATGGAGCTCTAGTAAGAACAAGACGTTTA
CGAAGACTGTCACAAGGCGGAGGCAAAGAATGCGACGTGGTCGAAGAAGTACGACCTACT
GTTTTGGAGACTACGGCACATTGGACTCCGGGTCCTTGGGGACCATGCCGTGTTGCTGTG
GAACAAGCGGCCACTGCTGCACCAACTGATGATGACGATGACGCGGATGACAATGACGCG
ATTTACGACGAGGATGACGAGAGTGACGAAAGCGAGCAGTCAAGCTGTGGGGGCGGAGTG
CAGCGACGTGCTGCGACCTGCGTCCGGGCAGACGGGCGAGCGTTGCACGACGCGCAGTGT
GCTCATGCAGTTATGCCGACACTCGTGCAACCTTGCGAGGTACCTTGCCCACGTGATTGT
GAAGTTGGAGAATGGAGTGAATGGGGTGCCTGTCAGCCTACTGACGGATGTCCTCTTTAC
CCAGTACAACAACTCACAACTACTGGGTACAGCGTACGTCGTCGAAGAGTAACAGCAGCA
GCATCCGGTGGAGGGGCGCCATGTCCTCCTCTAGAGGAAAAACGTACTTGCACTACACCA
AGATGTGCTGCGTGGAAAGCACTGCCGTGGGGCCCCTGTGTATTGACTCAACCCCATACT
AGTTGTGGACCAGGCCGACGTACCAGAGAACTTAGATGCATGGGACACGATGGAAAGGAA
GCTCAACGAGCGTGGTGTAACACTGGCGCTCCACCACGCAGTGAACGATGTCGTATCGCT
TGTCCGGGAGACTGTGTAGTGTCTGCATGGGCGGAGTGGTCACCTTGTTCAGCGAGTTGC
GTCGCCCCCGGACATGCACGACCCACGCGTACTAGACGCAGACATATACTTGCCCACGCT
GCACCAAATGGCTGGCCCTGCCCATCTGAAGACCAACTGATTCAAAACGAGACATGCAAC
ACCCACGCTTGTGCCACTTATTCCTGGCTTGCAACGCCTTGGGGACCTTGTGAGCGTCGC
CGGCAAGATTTCATACCGGCTACCAATTACACAGATCTTCTGGATAATGAACCGTTCAAT
GAAAGCGATGACGAAGAACCTTGTATTGAAGAAGGTGAAATGAGCAGAGACGTTATGTGT
GTTCAGAATAATGCTGACGTTGTCAGAGAAGCTCTATGTGCTCCATTACGTCGCCCAGCG
TCCCGTCGGGCGTGTACTGTAAGATGTCGACGAGGTTGCAGGGTTGAAGCTTGGATGCCC
TGGTCTCCATGTCCTGATACTTGTGACCCTGGTAAGCAGGTCCGCGTTCGCACCGTCCGA
GGTGGTCCGAACTGCGGTCCGTTGCAGGAGACACGCGACTGTCCCGTTTCGAGGTCGTGT
CGTTCCCGTGAGGCTGTCTGGGTCGCCGGGGAGTGGAGTACCTGTAGATTACCGCCAGGA
CAACGCTGTGGAGTTGGCTATAGGATTAGAAGTATCTGGTGCGGCTCGGACTCTCACCGC
GTCGAGGCCGGCGCATGTGCTGGTGCCCGGGTGCCGCCCGCTGCAGCAGCCTGCAGCGTC
ACATGCGACACCATCGTACCACTCACTTGTGATATCATATGTTCAGATCCCCTAAAATAC
TTGGATGCCTCTGACCCCGACGTACCCTCATGCGTCTGCAAGAATGTCTCATTGGAACTG
TTACCCGCTGATTCAGACTGTATTCTTCCACCTGGAATTGAATGCGGTGAAGGGAGATCA
CTGCGGGCAGCTCGTTGTTTAGTTGGAAGACGTGATGTACCCATGGATGTTTGTAGGAAA
TACCATCCCCTTACAGGACCCCGTCGCGTTCGTGAAGCAGCAACAGACGGCTTCACATAT
GATGAGGAATTCACATCTTTATTACGCGGTGCATGTAGCGTGCGGTGTGCGAGGGACTGT
GCGGTCGGGGCGTGGGCTGCCTGGGGACCGTGTGCTGCTGAGCCGGGTTCCAGAGCTGCT
TTCAGGTTCCGCACCAGGGAAGTAATAGAGGAAGGTTCGGCTGGTGGTCGTGAATGTGGC
GCCACATTGCAGCGCTCTACGTGCGTTGTGACTGAGCCACGATGGATACTGGGCGAGTGG
TCTGTGTGCGCTCCGAGACGAGCTCTATGTGGACGAGCCATTATCAATAGGACTGTTATG
TGCATAGATGCGGATGGGAATAAATTGGAGGACACACAGTGTGAGGCGGCCGGCGCTGGT
CCTGCGCCCTCTCGCGATGCGACATGTCGGGCTCCGTGCCCTTCTGACTGTGTTGTCAGC
TCTTGGTCAGACTGGAGTCCATGTGAACAGACGAAATGGGGCGGTCGTCGTGATAGGACT
CGTGTGGTTCTCCGCGCGGCTGCTGAGGGCGGGACTGCCTGCCCTCACCTGGTGGCTGCG
GAGCCTTGTTCACCGCACGCCTACTCCTGGCACGTGGCACCCTGGGATGACTGTCAACCG
CTGGGTGGGTCTCCGTGTGGGGAAGGAACAAAGAGAAGAGCTGTACGGTGCCTTCGCAGC
GATGGTGTTTTCGTAAATGATTCATTCTGTCCGAACGCAACGGCATCCGAGGCTCGGGAG
TCATGGTGCTACGTTCCATGTGGCGTAGACTGTGAGGTTGGAGAGTGGGGACCTTGGGAC
GCCTCCGCCTGCTCCTGCGGGGACGCAGTCACAGCACGCCACATGAGACGGATACGTCAA
CACTTGACGGCAGCTGTATGGCCGGGTCGCGCGTGCCCTCCCACTGAGCAACGAGCTCCC
TGCCCGCGAGAACCATGCTTGAGACTCGTCGCTAGACCGCTATTAGGATGTCATGTACAA
ACGTCATCAGGAGAAGAAGCTGATAATGCATGCGGATGGGGAGTGAAGTTATCTCATGCA
AGATGTGAACTGACTAGCATCAACGATGAACCGTCATCAGGAGCCTTCTTACAACCCTGG
AGATGTGCCTCCGCTCTACCGGGACGTATCGTTACACCGCCAATGCATCATCAGGAGGAC
GAGGAGTGTGAGGTCGAATGTGGATGCCAGGAATCTGAGCTGGGGCAGCCGGGTCCGTGG
GGCGCTTGGGGCGGCTGCCGTGGTGGGGCACGTTCGAGGACACGTACACTACTGGTACCA
CCCCGAAGAGCCTGCAGAACATCCTCCAGATACATAACAATCGAGTGGTCGAACTGCACC
GAGGAGGCTTCGGAGGCGACAGCTGGTGGTGACGGAACGCGAGGCGCCTGGCTTTCAGAA
CACTACCATGACGGATATATAGAGGGAAGTACTTCAGTGTTGGCGGTAGTGTGGACTGCG
ACCATAATACTCAGCTTGTATGGCGCGTTCATGCTCTATCGTGGACTTCTAAGATGCATC
AGAAGCAGAAAAATGAAGAGCATCACTAAAGTGTAA

Protein sequence:

MHALVILLLAAAVAADEVTDPNSLLDPAPEPVVIEGEYTVYVGTWTECSTGDGDQAKSFD
RYNARLESDSLVHTPRLGLQRRQVQCRRKDGRFVEALYCGIAIANIGTTRVCVMREDCSL
AEWLPWRPRSDGALVRTRRLRRLSQGGGKECDVVEEVRPTVLETTAHWTPGPWGPCRVAV
EQAATAAPTDDDDDADDNDAIYDEDDESDESEQSSCGGGVQRRAATCVRADGRALHDAQC
AHAVMPTLVQPCEVPCPRDCEVGEWSEWGACQPTDGCPLYPVQQLTTTGYSVRRRRVTAA
ASGGGAPCPPLEEKRTCTTPRCAAWKALPWGPCVLTQPHTSCGPGRRTRELRCMGHDGKE
AQRAWCNTGAPPRSERCRIACPGDCVVSAWAEWSPCSASCVAPGHARPTRTRRRHILAHA
APNGWPCPSEDQLIQNETCNTHACATYSWLATPWGPCERRRQDFIPATNYTDLLDNEPFN
ESDDEEPCIEEGEMSRDVMCVQNNADVVREALCAPLRRPASRRACTVRCRRGCRVEAWMP
WSPCPDTCDPGKQVRVRTVRGGPNCGPLQETRDCPVSRSCRSREAVWVAGEWSTCRLPPG
QRCGVGYRIRSIWCGSDSHRVEAGACAGARVPPAAAACSVTCDTIVPLTCDIICSDPLKY
LDASDPDVPSCVCKNVSLELLPADSDCILPPGIECGEGRSLRAARCLVGRRDVPMDVCRK
YHPLTGPRRVREAATDGFTYDEEFTSLLRGACSVRCARDCAVGAWAAWGPCAAEPGSRAA
FRFRTREVIEEGSAGGRECGATLQRSTCVVTEPRWILGEWSVCAPRRALCGRAIINRTVM
CIDADGNKLEDTQCEAAGAGPAPSRDATCRAPCPSDCVVSSWSDWSPCEQTKWGGRRDRT
RVVLRAAAEGGTACPHLVAAEPCSPHAYSWHVAPWDDCQPLGGSPCGEGTKRRAVRCLRS
DGVFVNDSFCPNATASEARESWCYVPCGVDCEVGEWGPWDASACSCGDAVTARHMRRIRQ
HLTAAVWPGRACPPTEQRAPCPREPCLRLVARPLLGCHVQTSSGEEADNACGWGVKLSHA
RCELTSINDEPSSGAFLQPWRCASALPGRIVTPPMHHQEDEECEVECGCQESELGQPGPW
GAWGGCRGGARSRTRTLLVPPRRACRTSSRYITIEWSNCTEEASEATAGGDGTRGAWLSE
HYHDGYIEGSTSVLAVVWTATIILSLYGAFMLYRGLLRCIRSRKMKSITKV