DPGLEAN00876 in OGS1.0

New model in OGS2.0DPOGS204081 
Genomic Positionscaffold2858:+ 23847-37772
See gene structure
CDS Length3597
Paired RNAseq reads  2579
Single RNAseq reads  6797
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010816 (3e-08)
Best Drosophila hit  distracted (1e-66)
Best Human hitattractin isoform 2 (2e-52)
Best NR hit (blastp)  PREDICTED: similar to attractin [Nasonia vitripennis] (0.0)
Best NR hit (blastx)  attractin [Aedes aegypti] (3e-142)
GeneOntology terms  GO:0016020 membrane
InterPro families




  
IPR000859 CUB
IPR016201 Plexin-like fold
IPR015915 Kelch-type beta propeller
IPR003659 Plexin/semaphorin/integrin
IPR002049 EGF-like, laminin
IPR002165 Plexin
Orthology groupMCL10837

Nucleotide sequence:

ATGGTAGAATCATTGCAAATGTTTTTATTTCTTTTCAAATCAAAATACAGACGAAAATAT
TCGTGGTTCTCGCCGTTCTTGTGTTCAGTGCTCATAGTGTTATTGTTTTGTCATGGTGTA
CTATCGAAGTGTAGTGATCACAACTGTTTTAACGGTGTATGTAACAATGATACCTGCGTG
TGCTACGAGGGCTGGCAGGGCTCCCAGTGCCAACACTGCGGCGGGAAGATTAAATTGACG
GAGACGTCTGGTGTTATAACCGATGGTCCCGGTAATTATAGCGTTAGTACACAATGCTCG
TGGTTGATCACACCGCCACGCGTGGGGCCCACGTTGCCCACTGTGCGGGTGACACTGGAG
AGTTTTGCCACGGAGTGCGGATGGGATCATTTGTATGTATACGATGGTGATAGTGTCCGA
GCTGAGAAACTATTGGCTGTGTTTAGCGGGGTTTTAGATAAGAACGAGTCTAACTGGACT
CGCCAGGTTATAGCGCGGTCAGGTAGTGTTCTTTTGCATTTCTTCTCTGACGATGCTTAC
GCCATGGAAGGTTTTAATGTCACCTACGATGCCTACTCCTGCCCATCCAACGACCACAGG
ACCAACTGTTCCGATCACGGCGAGTGCGAGGAAGGTTCCTGTCGGTGTGACGATGACTGG
CTCGGGGTAGCCTGTGACCAGCCTTTGTGTCCCAACGACTGTAACGCTATGTACGGAGCT
GGGTCGTGTACGTCGTCTGGCTGCGTGTGCACGCCGTCCAAGACTGGAGCAGATTGCAGC
CGGGACGCGTTTATATCCGGCTGGGGGTGGGCGTGGCGGGAGGAGGGGGAGGGGGGTGAA
CGCCCGCGGAACATGCCGCCGCCGACAGCTGGGCACGTGCTTGTCAACTATGGTGACGAT
ATAATAATGGTGGGAGGGGAAATGTTCCAAGACGCAGCGTTTATGTACAGATATAAACCG
AGCCTTAAGGAGTGGAAGGTAGTTGAGGCCCGGGGCAAGGCGCCACAGATGCGGTTCGCC
CACACGGCCATAGTCCACGGCGAGGAGATCATAGTGTATGGCGGGGTGGTGGTCACCGAC
GAGCTGGAGAGGAGCGGGGGTCTCGCGGGGGTGGAGGGGCGGGCGGCGTTCGTCAGCAAC
GAGATCTGGACGGGTCGGCTGTCGGGGGGCTTCGTCCACTGGACCAACGACACGCCGCGG
ACGTGCTCTCCTCACCACCCCGCGCCGTTCGACCACTGCGTACTCCTTACATACTTTCCG
TCCCGCACCCCAGGCGGGCTCCACCTGTCGGGCCACTCGTCAGTCCTGGTTCAAATCGGC
CCAACCTCCAAGCCCGTGATGTTGGTGTTCTTCGGCCACTCCCCCCACTACGGCTACCTG
CATCTCGTACAGGAAACATTGATATGGGAACTATATCTTGGAGACGCTCAGGCCAGCAGC
GGGGGTCTCGCGGGGATGGAGGGGCGGGCGGCGTTCGTCAGCAACGAGATCTGGACGGGT
CGGCTGTCGGGGGGCTTCGTCCACTGGACCAACGACACGCCGCGGACGTGCTCTCCTCAC
CACCCCGCGCCGTTCGACCACTGCGGCGGGCTCCACCTGTCGGGCCACTCGTCAGTCCTG
GTCCAAATCGGCCCAACCTCCAAGCCCGTGATGTTGGTGTTCTTCGGCCACTCCCCCCAC
TACGGCTACCTGCATCTCGTACAGGAGTACTATATAGAGGAGAAGGCGTGGGGCGTCGCC
CGGACCCGCGGCTGGCCGGCCAGGGGAGGGTTCGCTCACACCGCTGTATGGGACGCGCTC
AGCGGCAGGGTGTACGTACACGCTGGACTCGTCTCCGAGTCGGAGGCGACACAGGCGCCG
TCCGCCGCGCTGTACGAGTATGAAGTTGAAGCACGGATATGGCGCCCGCTGCCCTCCGCC
CCCACGCCCAGATATCTACACACCGCCATATTTATATCGCCAGGGGTCATGTTGGTGTTC
GGGGGGAACGCCCACAACGACAGTGCTGCCGCGGCGCTCACGGCCTCGGGCGCGTCCCAG
TGCTACGCGGCCAACGCGCTGCTGTACTACGCCAGGTGTCGCCAGTGGATGTCTGCGGGC
GGCCTGCTGGGCTCGCCTCGCGCCGGACACGCGGCCGCTCTGCTTCCGGCCAAGAGACCC
ACCGTCATCATACATGGCGGCTTCGACGGCCGCCTTCGCTCGGACGCGCTTGTCTTCGAG
TCCGGAATGCGCTGTTCGTGGTACAAGGACGAAACGTCCTGTATGAACAGCGCCAGGCAC
GGCGTCTCGTGTGTGTGGCGCCTTAGAGATATGCTGTGCGTCGGGATAAAGGAAGTAGGG
TGGAAGGATTCTTTCACGGATGCTGTAAAAGCCTGCATCGACGAGCCAGTAGTCGATGAG
TCTCGCTGTGCCGTGTCTTCGTGTGAAGCTTGTACAGCGCTTGGTTGTGCTTGGTGCGGC
TCGTGTCTCCCGTCCGCGTATCACTGTCGACGATCCCGGACGGCACACGGACCGGTGACC
CTGTCCGTGTCGGAGTGTCCGCCGAGCGGCGCGTCGTGTTCGCGCTACCACTCGTGCGCC
GCGTGTCACGCGCATCTACACAGACACCCTCATGGCTCGGAAGACTTAAACCAACGGGCG
TGTTACTGGGACTATGACACGGTGAAGTGCCGGCCGGCCAATGCGACCACGGATATAAGG
GGCTCGCCGAGTGTGTCGGGGTCGTGCAGCGCCGCGTGCTCGTCCTATACCACATGCGGG
AACTGCACCGCTGAAGAGTGCATCTGGTGCGCCTCCGCCGGGAGGTGCGTGGATAAGAAC
GCTTACGGAGCTTCGTTTCCGCTGGGCGGGTGTCGCGCGTGGTCCACCAGCGGCTGTGGA
GGTGTGGGGGTGACGGGGGGTGTCCCGGGGGGCGGCTGCTCGTCGCACGTGTCGTGTCGC
TCGTGTCTGTCGGAGCCCGCGTGCGGCTGGTGTGATGACGGCGCGGGCGGCGGGCGAGGA
GCCTGTCTGCCGGGAGGTGACCGTCACCCCCACCATCCCCACATCTGTCCCAGGAGACGA
TGGCACTTCACGTCGTGTCCGTCGTGTCAGTGTAACGGCCACTCGGTGTGCGACGCGGCG
TCCCGTTGTGTCCAGCCGTGCGGGTCCCGGGCCGTGGGCCCCCACTGTGACACTTGCGCC
CCCGCGCACTGGGGTACCCCGCTCAACGGGGGGGTCTGCACGCCGTGTGAGTGTAACGCC
CAGGCCGTGTCGTGCGCGGCGGACACGGGCCGCTGTTTCTGCAGCACCAAGGGCCTGGCG
GGCGACAGGTGCGACAAGTGTGACAACACCAACCACTACCACGCCGACGTCTACAACAAG
GGCTGCTACTACGACCTAGCCGTCGACTATCAGTTCACCTTCAACCTGTCCAAGAAGGAG
GATCGTCATTTGTCCGCCATTAACTTCCGGAACGCTCCCGTCAAACCGGACGTGGACGCT
GACTTCAGTATCACGTGTTCCGCCCACGCCAGGATGAATCTCACCGTCAGGACCAAATCT
GATCCTGAGAGGACGTTATTCAGTGACGTCAATTGCACCAATTTTAGATACAAGTGA

Protein sequence:

MVESLQMFLFLFKSKYRRKYSWFSPFLCSVLIVLLFCHGVLSKCSDHNCFNGVCNNDTCV
CYEGWQGSQCQHCGGKIKLTETSGVITDGPGNYSVSTQCSWLITPPRVGPTLPTVRVTLE
SFATECGWDHLYVYDGDSVRAEKLLAVFSGVLDKNESNWTRQVIARSGSVLLHFFSDDAY
AMEGFNVTYDAYSCPSNDHRTNCSDHGECEEGSCRCDDDWLGVACDQPLCPNDCNAMYGA
GSCTSSGCVCTPSKTGADCSRDAFISGWGWAWREEGEGGERPRNMPPPTAGHVLVNYGDD
IIMVGGEMFQDAAFMYRYKPSLKEWKVVEARGKAPQMRFAHTAIVHGEEIIVYGGVVVTD
ELERSGGLAGVEGRAAFVSNEIWTGRLSGGFVHWTNDTPRTCSPHHPAPFDHCVLLTYFP
SRTPGGLHLSGHSSVLVQIGPTSKPVMLVFFGHSPHYGYLHLVQETLIWELYLGDAQASS
GGLAGMEGRAAFVSNEIWTGRLSGGFVHWTNDTPRTCSPHHPAPFDHCGGLHLSGHSSVL
VQIGPTSKPVMLVFFGHSPHYGYLHLVQEYYIEEKAWGVARTRGWPARGGFAHTAVWDAL
SGRVYVHAGLVSESEATQAPSAALYEYEVEARIWRPLPSAPTPRYLHTAIFISPGVMLVF
GGNAHNDSAAAALTASGASQCYAANALLYYARCRQWMSAGGLLGSPRAGHAAALLPAKRP
TVIIHGGFDGRLRSDALVFESGMRCSWYKDETSCMNSARHGVSCVWRLRDMLCVGIKEVG
WKDSFTDAVKACIDEPVVDESRCAVSSCEACTALGCAWCGSCLPSAYHCRRSRTAHGPVT
LSVSECPPSGASCSRYHSCAACHAHLHRHPHGSEDLNQRACYWDYDTVKCRPANATTDIR
GSPSVSGSCSAACSSYTTCGNCTAEECIWCASAGRCVDKNAYGASFPLGGCRAWSTSGCG
GVGVTGGVPGGGCSSHVSCRSCLSEPACGWCDDGAGGGRGACLPGGDRHPHHPHICPRRR
WHFTSCPSCQCNGHSVCDAASRCVQPCGSRAVGPHCDTCAPAHWGTPLNGGVCTPCECNA
QAVSCAADTGRCFCSTKGLAGDRCDKCDNTNHYHADVYNKGCYYDLAVDYQFTFNLSKKE
DRHLSAINFRNAPVKPDVDADFSITCSAHARMNLTVRTKSDPERTLFSDVNCTNFRYK