New model in OGS2.0 | DPOGS204081  |
---|---|
Genomic Position | scaffold2858:+ 23847-37772 |
See gene structure | |
CDS Length | 3597 |
Paired RNAseq reads   | 2579 |
Single RNAseq reads   | 6797 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010816 (3e-08) |
Best Drosophila hit   | distracted (1e-66) |
Best Human hit | attractin isoform 2 (2e-52) |
Best NR hit (blastp)   | PREDICTED: similar to attractin [Nasonia vitripennis] (0.0) |
Best NR hit (blastx)   | attractin [Aedes aegypti] (3e-142) |
GeneOntology terms   | GO:0016020 membrane |
InterPro families    | IPR000859 CUB IPR016201 Plexin-like fold IPR015915 Kelch-type beta propeller IPR003659 Plexin/semaphorin/integrin IPR002049 EGF-like, laminin IPR002165 Plexin |
Orthology group | MCL10837 |
Nucleotide sequence:
ATGGTAGAATCATTGCAAATGTTTTTATTTCTTTTCAAATCAAAATACAGACGAAAATAT
TCGTGGTTCTCGCCGTTCTTGTGTTCAGTGCTCATAGTGTTATTGTTTTGTCATGGTGTA
CTATCGAAGTGTAGTGATCACAACTGTTTTAACGGTGTATGTAACAATGATACCTGCGTG
TGCTACGAGGGCTGGCAGGGCTCCCAGTGCCAACACTGCGGCGGGAAGATTAAATTGACG
GAGACGTCTGGTGTTATAACCGATGGTCCCGGTAATTATAGCGTTAGTACACAATGCTCG
TGGTTGATCACACCGCCACGCGTGGGGCCCACGTTGCCCACTGTGCGGGTGACACTGGAG
AGTTTTGCCACGGAGTGCGGATGGGATCATTTGTATGTATACGATGGTGATAGTGTCCGA
GCTGAGAAACTATTGGCTGTGTTTAGCGGGGTTTTAGATAAGAACGAGTCTAACTGGACT
CGCCAGGTTATAGCGCGGTCAGGTAGTGTTCTTTTGCATTTCTTCTCTGACGATGCTTAC
GCCATGGAAGGTTTTAATGTCACCTACGATGCCTACTCCTGCCCATCCAACGACCACAGG
ACCAACTGTTCCGATCACGGCGAGTGCGAGGAAGGTTCCTGTCGGTGTGACGATGACTGG
CTCGGGGTAGCCTGTGACCAGCCTTTGTGTCCCAACGACTGTAACGCTATGTACGGAGCT
GGGTCGTGTACGTCGTCTGGCTGCGTGTGCACGCCGTCCAAGACTGGAGCAGATTGCAGC
CGGGACGCGTTTATATCCGGCTGGGGGTGGGCGTGGCGGGAGGAGGGGGAGGGGGGTGAA
CGCCCGCGGAACATGCCGCCGCCGACAGCTGGGCACGTGCTTGTCAACTATGGTGACGAT
ATAATAATGGTGGGAGGGGAAATGTTCCAAGACGCAGCGTTTATGTACAGATATAAACCG
AGCCTTAAGGAGTGGAAGGTAGTTGAGGCCCGGGGCAAGGCGCCACAGATGCGGTTCGCC
CACACGGCCATAGTCCACGGCGAGGAGATCATAGTGTATGGCGGGGTGGTGGTCACCGAC
GAGCTGGAGAGGAGCGGGGGTCTCGCGGGGGTGGAGGGGCGGGCGGCGTTCGTCAGCAAC
GAGATCTGGACGGGTCGGCTGTCGGGGGGCTTCGTCCACTGGACCAACGACACGCCGCGG
ACGTGCTCTCCTCACCACCCCGCGCCGTTCGACCACTGCGTACTCCTTACATACTTTCCG
TCCCGCACCCCAGGCGGGCTCCACCTGTCGGGCCACTCGTCAGTCCTGGTTCAAATCGGC
CCAACCTCCAAGCCCGTGATGTTGGTGTTCTTCGGCCACTCCCCCCACTACGGCTACCTG
CATCTCGTACAGGAAACATTGATATGGGAACTATATCTTGGAGACGCTCAGGCCAGCAGC
GGGGGTCTCGCGGGGATGGAGGGGCGGGCGGCGTTCGTCAGCAACGAGATCTGGACGGGT
CGGCTGTCGGGGGGCTTCGTCCACTGGACCAACGACACGCCGCGGACGTGCTCTCCTCAC
CACCCCGCGCCGTTCGACCACTGCGGCGGGCTCCACCTGTCGGGCCACTCGTCAGTCCTG
GTCCAAATCGGCCCAACCTCCAAGCCCGTGATGTTGGTGTTCTTCGGCCACTCCCCCCAC
TACGGCTACCTGCATCTCGTACAGGAGTACTATATAGAGGAGAAGGCGTGGGGCGTCGCC
CGGACCCGCGGCTGGCCGGCCAGGGGAGGGTTCGCTCACACCGCTGTATGGGACGCGCTC
AGCGGCAGGGTGTACGTACACGCTGGACTCGTCTCCGAGTCGGAGGCGACACAGGCGCCG
TCCGCCGCGCTGTACGAGTATGAAGTTGAAGCACGGATATGGCGCCCGCTGCCCTCCGCC
CCCACGCCCAGATATCTACACACCGCCATATTTATATCGCCAGGGGTCATGTTGGTGTTC
GGGGGGAACGCCCACAACGACAGTGCTGCCGCGGCGCTCACGGCCTCGGGCGCGTCCCAG
TGCTACGCGGCCAACGCGCTGCTGTACTACGCCAGGTGTCGCCAGTGGATGTCTGCGGGC
GGCCTGCTGGGCTCGCCTCGCGCCGGACACGCGGCCGCTCTGCTTCCGGCCAAGAGACCC
ACCGTCATCATACATGGCGGCTTCGACGGCCGCCTTCGCTCGGACGCGCTTGTCTTCGAG
TCCGGAATGCGCTGTTCGTGGTACAAGGACGAAACGTCCTGTATGAACAGCGCCAGGCAC
GGCGTCTCGTGTGTGTGGCGCCTTAGAGATATGCTGTGCGTCGGGATAAAGGAAGTAGGG
TGGAAGGATTCTTTCACGGATGCTGTAAAAGCCTGCATCGACGAGCCAGTAGTCGATGAG
TCTCGCTGTGCCGTGTCTTCGTGTGAAGCTTGTACAGCGCTTGGTTGTGCTTGGTGCGGC
TCGTGTCTCCCGTCCGCGTATCACTGTCGACGATCCCGGACGGCACACGGACCGGTGACC
CTGTCCGTGTCGGAGTGTCCGCCGAGCGGCGCGTCGTGTTCGCGCTACCACTCGTGCGCC
GCGTGTCACGCGCATCTACACAGACACCCTCATGGCTCGGAAGACTTAAACCAACGGGCG
TGTTACTGGGACTATGACACGGTGAAGTGCCGGCCGGCCAATGCGACCACGGATATAAGG
GGCTCGCCGAGTGTGTCGGGGTCGTGCAGCGCCGCGTGCTCGTCCTATACCACATGCGGG
AACTGCACCGCTGAAGAGTGCATCTGGTGCGCCTCCGCCGGGAGGTGCGTGGATAAGAAC
GCTTACGGAGCTTCGTTTCCGCTGGGCGGGTGTCGCGCGTGGTCCACCAGCGGCTGTGGA
GGTGTGGGGGTGACGGGGGGTGTCCCGGGGGGCGGCTGCTCGTCGCACGTGTCGTGTCGC
TCGTGTCTGTCGGAGCCCGCGTGCGGCTGGTGTGATGACGGCGCGGGCGGCGGGCGAGGA
GCCTGTCTGCCGGGAGGTGACCGTCACCCCCACCATCCCCACATCTGTCCCAGGAGACGA
TGGCACTTCACGTCGTGTCCGTCGTGTCAGTGTAACGGCCACTCGGTGTGCGACGCGGCG
TCCCGTTGTGTCCAGCCGTGCGGGTCCCGGGCCGTGGGCCCCCACTGTGACACTTGCGCC
CCCGCGCACTGGGGTACCCCGCTCAACGGGGGGGTCTGCACGCCGTGTGAGTGTAACGCC
CAGGCCGTGTCGTGCGCGGCGGACACGGGCCGCTGTTTCTGCAGCACCAAGGGCCTGGCG
GGCGACAGGTGCGACAAGTGTGACAACACCAACCACTACCACGCCGACGTCTACAACAAG
GGCTGCTACTACGACCTAGCCGTCGACTATCAGTTCACCTTCAACCTGTCCAAGAAGGAG
GATCGTCATTTGTCCGCCATTAACTTCCGGAACGCTCCCGTCAAACCGGACGTGGACGCT
GACTTCAGTATCACGTGTTCCGCCCACGCCAGGATGAATCTCACCGTCAGGACCAAATCT
GATCCTGAGAGGACGTTATTCAGTGACGTCAATTGCACCAATTTTAGATACAAGTGA
Protein sequence:
MVESLQMFLFLFKSKYRRKYSWFSPFLCSVLIVLLFCHGVLSKCSDHNCFNGVCNNDTCV
CYEGWQGSQCQHCGGKIKLTETSGVITDGPGNYSVSTQCSWLITPPRVGPTLPTVRVTLE
SFATECGWDHLYVYDGDSVRAEKLLAVFSGVLDKNESNWTRQVIARSGSVLLHFFSDDAY
AMEGFNVTYDAYSCPSNDHRTNCSDHGECEEGSCRCDDDWLGVACDQPLCPNDCNAMYGA
GSCTSSGCVCTPSKTGADCSRDAFISGWGWAWREEGEGGERPRNMPPPTAGHVLVNYGDD
IIMVGGEMFQDAAFMYRYKPSLKEWKVVEARGKAPQMRFAHTAIVHGEEIIVYGGVVVTD
ELERSGGLAGVEGRAAFVSNEIWTGRLSGGFVHWTNDTPRTCSPHHPAPFDHCVLLTYFP
SRTPGGLHLSGHSSVLVQIGPTSKPVMLVFFGHSPHYGYLHLVQETLIWELYLGDAQASS
GGLAGMEGRAAFVSNEIWTGRLSGGFVHWTNDTPRTCSPHHPAPFDHCGGLHLSGHSSVL
VQIGPTSKPVMLVFFGHSPHYGYLHLVQEYYIEEKAWGVARTRGWPARGGFAHTAVWDAL
SGRVYVHAGLVSESEATQAPSAALYEYEVEARIWRPLPSAPTPRYLHTAIFISPGVMLVF
GGNAHNDSAAAALTASGASQCYAANALLYYARCRQWMSAGGLLGSPRAGHAAALLPAKRP
TVIIHGGFDGRLRSDALVFESGMRCSWYKDETSCMNSARHGVSCVWRLRDMLCVGIKEVG
WKDSFTDAVKACIDEPVVDESRCAVSSCEACTALGCAWCGSCLPSAYHCRRSRTAHGPVT
LSVSECPPSGASCSRYHSCAACHAHLHRHPHGSEDLNQRACYWDYDTVKCRPANATTDIR
GSPSVSGSCSAACSSYTTCGNCTAEECIWCASAGRCVDKNAYGASFPLGGCRAWSTSGCG
GVGVTGGVPGGGCSSHVSCRSCLSEPACGWCDDGAGGGRGACLPGGDRHPHHPHICPRRR
WHFTSCPSCQCNGHSVCDAASRCVQPCGSRAVGPHCDTCAPAHWGTPLNGGVCTPCECNA
QAVSCAADTGRCFCSTKGLAGDRCDKCDNTNHYHADVYNKGCYYDLAVDYQFTFNLSKKE
DRHLSAINFRNAPVKPDVDADFSITCSAHARMNLTVRTKSDPERTLFSDVNCTNFRYK