DPGLEAN13949 in OGS1.0

New model in OGS2.0DPOGS213644 
Genomic Positionscaffold543:- 42481-48765
See gene structure
CDS Length5292
Paired RNAseq reads  2421
Single RNAseq reads  6017
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004585 (0.0)
Best Drosophila hit  ND
Best Human hitND
Best NR hit (blastp)  vitellogenin [Actias selene] (0.0)
Best NR hit (blastx)  vitellogenin [Actias selene] (0.0)
GeneOntology terms

  
GO:0003674 molecular_function
GO:0005576 extracellular region
GO:0008150 biological_process
InterPro families




  
IPR015816 Vitellinogen, beta-sheet N-terminal
IPR011030 Vitellinogen, superhelical
IPR001747 Lipid transport protein, N-terminal
IPR001846 von Willebrand factor, type D domain
IPR015819 Lipid transport protein, beta-sheet shell
IPR015255 Vitellinogen, open beta-sheet
Orthology groupMCL11119

Nucleotide sequence:

ATGAAGCTGTTGGTCCTAGCGGCCACTATTGTGGTCGTTTCATCCGGTCAACTTAGTGAC
GTTGTTGTGGAGTCGCCATGGCCCTGGCAAGTCGGAAAATTATATCGCTACGATGTGGAA
ACACATACCCTGGCACGTTATTTAGACAGTTTCAGTTCGGGCAACGCTTTCAGAGCCAAG
TTTACAGTGCGAGCTAAGTCAGACGGCTACCTACAGGCACGGTTGGAGAATCCAGAATAT
GCCAAAGTCTACCAAAAGCTAGAACAACACGATCCCATGCCAGAAGATCTGAAATACGTA
CCTGTGGCAAATTTGGACAAACCGTTTGAGATTTACATAGAAGGTGGAAGAATACTTTCA
GTAAAATTACCATTTTCTGTAACACTCATGCAAGAAAACTTGATTAAAGGTCTAATCGGT
TCGCTTCAAGTAGATCTCACCAGTCATCGTAATGTAAAAAGCTCCCATGACACGTACGAT
ACTCAAGTACAACAGGGATTATTCCGCAAGATGGAAATCGATGTGACGGGCGATTGTGAA
ACTCTTTACACAGTATCTCCTGCTGCGTCTGAATGGAGACGTGAACTTCCCAGTTTTGCT
TCCGACGATGAACCAATTGAAATAACTAAAAGCAAGAACTACGGTCATTGTCATCATCGT
GTAGATTATCACTTCGGTGTACCGGAAGGAGCAGAATGGTCTGGCACTGCCCACAAAACT
GGAAAGGAACAATTCATAAATCGTGCTACGGTCTCAAGAATGCTGGTTGGGAAGAATGGT
CACATATACAAAGCTGAAACAACCAGTACGGTTACTGCTCATCCCCATTTATATGGGGAA
CAAAAGGCACAGGTACACGGCAAAGTGCGTTTTAATTTAATGTCGTATGAGGATGATAAT
GAACCGGCATGGGTATACCCCGAAGGTGCGCGTGAAGTTACCAACTTATTATATGCTTTG
ACGGCAAAACCAATTGATATCGGTGATAGTTCGTCGTCTGAAAAGTCTATAAAAATTGAG
AAACATCCGAGGCAACGTCGCTCCAGTCGTATGAAATCTTTCGTCTCCATAAATAAGAAG
ATTGTTACTGAAACACATGGATCTTCCAGTTCCAGTGAATCAGACTCAGTATATGTAAAT
GATGACATTCCCAATATCAACGAACCCGCCTATGCTTCTCTCTATATGAACCCAGATCTT
CATGGTGATAAGAAACAGAATCCCATGAATGCTCAGAAGCTTTTACAAGAAATCGCCCAA
CAATTGCAAAATCCGAACAATATGCCGAAAGCGGATACCTTATCCAAATTTAATATTCTA
GTTCGTGTCATCGCCAGTATGAGTTATGGACAGCTCGGTCTGACAAGCCGCAGCATTGAA
ATTGCTAAGTTGGCTAATGATGTCGTGAAGTCTAACATGTGGATGATCTACAGAGATGCT
GTCGCCCAAGCCGGTACTCTGCCCGCATTCCAACAGATAAAGGCTTGGATTGAAAGCAAA
AAATTAGAAGGAGAAGAGGCGGCGGAAGTTATTTCCGTGCTTGCAGTATCTCTAAGGTAT
CCCACGAAGGTGGTCATGAAACAATTCTTTGATCTCGCCATGAACCCCGAGGTAACTAAA
CAGATGTTCCTTAATGACACTGCACTAATCGCTGCTGCTAAATTAATAAACATGGGACAA
GTAAACAATGAAACTGTGCATCGTTACTATCCGACACATATGTACGGACGTCCATCACCT
AAGGAAGATGCCTTCGTGATTAATGAAATTCTTCCCCGTCTGAGTCAGGAGCTTCAACTG
GCTATTGAAAATGGGGATAGTCGAAAATCACAAGTATATATTAAGGCTATCGGCGAACTT
GGTCACCCAGCTATCCTGGATATATTTAAACCGTACCTTGAAGGCAAAATTCCGGCTTCA
ACTTATCTTAGAACCAGAATCATAGAACATCTCTATGTTCTGGCCAAAGGAAGGGATGAT
TATGTACGTGCTGTGTTATTTAGCGTTTTGAAAAATACTGCTGAACCATATGAAGTAAGA
GTAGCAGCCATCGATAAAATCTTTATGTCACGACCAAGTACAGCGATGATGATGGCAATG
GCACAAATGACTAAAGACGATCCTAGTATCCAAATCCGTGCAGCGCTTAAATCGGCAATT
ACATCTGCATCAGAACTTAAAAATCCAAGATTCCATGACCTGGCAAGAACAGCAGCAGCT
GTTAAGGATATGCTCACAAGTGAAGAGTTTGGTTTACAATACTCTGGTAAAAACTTCCTG
GAACACTACGACAGGGATGAGCAACCAAGTTCTATGTCAGTACTCTCAAGACTGGGAAGC
AAGGATAGTCTGCTTCCGAAATATTGGAGATATTCATGGAAAGGAAGAGACGGAGGTTGG
GATCAAGAAACAGTTATCTCAGGAGCTGCTTCAAGTTGGCAGGAACTATTTGATCTCTTC
GCAGATCAGATGTTTGGACAAAGAAAACCCGATCAATATCCCGAATACAATCCTAAATAC
TCCGCTGAAAAGATTGCTGACATGTTGAACGTAAAAAAAGACGACCGAGAATCATCAGAG
GGCTCATTTTATATAAATTTACTAAATCAGAGGAGATACTTTGCTTTCAATGAAAATGAT
GTTAAAGAATTAGGCATTAAATTTCGCGAGTACTTAACAAATCTCAAAGACGTTGCTAAG
CAATACACTAAAGTCGTTAACAGGAACCAAGTGTCAGTCATGTTCCCTATAGCTACAGGA
GTACCATTTATTTATAAATATAAGGAACCGGTTCTCCTACATGTTCGTACTGTAACTAAA
GGAAACGTTGATTTTAAGGATAGAGAGGAATATAGGTCTAGTGCTTCTATCAATAGCGAG
CTGCGGATAATTTACGCTGAAAATCATGATGGCAATGTTGGTTTTCTAGACACTCTTGGT
AATCAACTTGCAAGCGTTGGATTAGTGAGAAAAAGTCAACTTAATATTCCAATTAAAATA
GATCTTGAAATGAAATCTGGAGAAGCGAAGTTCCATTTAAGTCCAATGGAACCCGAACAA
GATAATACTATAGCTCATTACAGTGTTTGGCCATATTCCGCAAACCAAAAGAAGGACACT
TTAACACCTATTTCTCAGGATCCTATATCAAGAGTTATTATGAGACCCGAAAAAGTAGCC
CAGATTGATAGCAAGTTTGGACAAAACTTTGGATCCATATTCCAACTCCAGGGTTATTCT
TATTCTGAAGATTACAGGTACATAGGAGACATGCTGAAGTCCTACAATTATTTAACTAGT
ATTATCAGGATGTTCAAGCAAAAAGATATAGCTCAAACTCACTTTAATCTGAGGTACTTG
GGAAAGCAATCTAAGAACAAAGGAGTCACAATCACAGTAGCTTACGACACACTGTATAAT
CAGAAAGAAACAGGCGTTATGCCAATAACTGCATCGGATGTGAAGGACTCGACACCCAAC
AGTCCATCACGACGAGAGGAATTAATTAAACGTGTTATAGCTGGCATACAATCATCTAGA
GCCCACGTCGTTGATTTGAGCGCAAAATTCGAGACAGAACAAAAATTGGAGTATACTGCG
ACCCTTGCAATCGGCGCGAGTGTCGTCGATCAAAAAATTCAGTTTGCTTTATTTGCTGGT
AGAAACTCTGATCAATACGGATCAAATCAGTTAAATGCCGTAGGTAGAGTTACGAAACCA
TTGTCAGATTCCCCTATTAATTTCCAAAAAGCACTAGAAAAAGAACTGAAAATGGATTTT
GAGGCCGATATCCTTTACAACCAGAAAGAAAATATCCACATTCTTGGCTCTGCCGAAAGA
ACAAAGAGATATATAGAAGAACTTCAGAAAGAACCACAAGTAAAGAGATGTCTTGAAAAT
TATGCCAGAGGTAATTATTACCAACACGACTGTCATGAAGCGGTTGTTATGGCCCATGCT
CCAGACAACTTCAAATTCAGTGTAAGTTACAAAGACGTCAGCTCTGGGACTAAAAATGCT
GCAGCCTACGCTTACAGAATTTTAGACGGGCTTAATTTATGGAGATCGGATATTAATATG
GCAAAGACGTTACCTGCTGGAAAACTTGAATTGAACGTTGATGCTTTATACTGGACAAGA
AATTTAAATCTTATTGTAAATTCTCGTTTTGGGGAATTGCGGGTAAACAATATACCTATA
CCTGAAGTTACTTCTAGAGCTGTGTCTATGTACTTACCGATCAGCGCCTATGAGCGAATT
CTAAATTATTACACCTGGCATCAGTATCAACCATATTGCAGTGTGGACAGTAACAGGGTG
AGGACCTTCAGTAACCGTGAATATGATTACACGCTGTCACCTTCCTGGCACGTAGTGATG
CACGATGACAGACCCGGCAGAAACGAGGATTTAGTCGTGCTGTCCAGAAGACCTCAAGAA
ATGAAAATGCAAATATACTTATCTTACAGATCTTACACTGGCAAATACATAGAGATGGAA
GTTCAACCAGCCCCGGACACTCAACAGAAGCACTCTGTTCAAGTCAAGACCAATGCCAAA
AAAGTGTCTGAAGGAGAACTTACAACCTACTGGGACGACGTCAATGACAGTCCGTTACTT
GAATACTACAGTACTGGCGACAATGTCTTAATGATCAAATTGCGTGAGAATCGTCTCAGA
ATCGTGTATGACGGAGAAAGGAGCGTAGTTCTTTCGAGAGACAACCGCAAAAACATCAGG
GGAATTTGTGGAAGAATGAGCGGTGATCCTCGCGATGACTACCTAACACCTAGTGGTCTC
GTAGATAAACCAGAATACTATGGAGCTTCCTACGCTCTTATTGAAGACGAGAATGATCCC
AGAACACAAGAATTGCAATCGGAAGCTAAAAGAAAGGCGTACGAGCCAAGAAAACAATAC
ACCACAATCTTGCAATCTGATAACAAATGGCAAAATGCTATGCTCTCTTCGTCTGAAGAT
GATTGGGACTCTCAGATCGTATACAGGGCAAGGAACTATGGAAAGAGTAAGGGAAAATGT
AAAGTAGTCCCTCAAGTGCAGTATTATGAGAACCAATCACAGATCTGTATAACCACCAGT
TCCTTACCGTCCTGCCAGTCTTCCTGTAGCGGAGGCAGCTACAAGATTCAGTCGACACAA
GTTGTTTGCCGCTCCAAGCTGGACTCTCAATTCCAATCTTACAGAGATGAAATCAAACTA
GGCAAAAGTCCCAAAGTCAGCGGAGAGCCGCGAACTGTAGACTACAGAGTCCCTAGTTCT
TGCAAATCCTAA

Protein sequence:

MKLLVLAATIVVVSSGQLSDVVVESPWPWQVGKLYRYDVETHTLARYLDSFSSGNAFRAK
FTVRAKSDGYLQARLENPEYAKVYQKLEQHDPMPEDLKYVPVANLDKPFEIYIEGGRILS
VKLPFSVTLMQENLIKGLIGSLQVDLTSHRNVKSSHDTYDTQVQQGLFRKMEIDVTGDCE
TLYTVSPAASEWRRELPSFASDDEPIEITKSKNYGHCHHRVDYHFGVPEGAEWSGTAHKT
GKEQFINRATVSRMLVGKNGHIYKAETTSTVTAHPHLYGEQKAQVHGKVRFNLMSYEDDN
EPAWVYPEGAREVTNLLYALTAKPIDIGDSSSSEKSIKIEKHPRQRRSSRMKSFVSINKK
IVTETHGSSSSSESDSVYVNDDIPNINEPAYASLYMNPDLHGDKKQNPMNAQKLLQEIAQ
QLQNPNNMPKADTLSKFNILVRVIASMSYGQLGLTSRSIEIAKLANDVVKSNMWMIYRDA
VAQAGTLPAFQQIKAWIESKKLEGEEAAEVISVLAVSLRYPTKVVMKQFFDLAMNPEVTK
QMFLNDTALIAAAKLINMGQVNNETVHRYYPTHMYGRPSPKEDAFVINEILPRLSQELQL
AIENGDSRKSQVYIKAIGELGHPAILDIFKPYLEGKIPASTYLRTRIIEHLYVLAKGRDD
YVRAVLFSVLKNTAEPYEVRVAAIDKIFMSRPSTAMMMAMAQMTKDDPSIQIRAALKSAI
TSASELKNPRFHDLARTAAAVKDMLTSEEFGLQYSGKNFLEHYDRDEQPSSMSVLSRLGS
KDSLLPKYWRYSWKGRDGGWDQETVISGAASSWQELFDLFADQMFGQRKPDQYPEYNPKY
SAEKIADMLNVKKDDRESSEGSFYINLLNQRRYFAFNENDVKELGIKFREYLTNLKDVAK
QYTKVVNRNQVSVMFPIATGVPFIYKYKEPVLLHVRTVTKGNVDFKDREEYRSSASINSE
LRIIYAENHDGNVGFLDTLGNQLASVGLVRKSQLNIPIKIDLEMKSGEAKFHLSPMEPEQ
DNTIAHYSVWPYSANQKKDTLTPISQDPISRVIMRPEKVAQIDSKFGQNFGSIFQLQGYS
YSEDYRYIGDMLKSYNYLTSIIRMFKQKDIAQTHFNLRYLGKQSKNKGVTITVAYDTLYN
QKETGVMPITASDVKDSTPNSPSRREELIKRVIAGIQSSRAHVVDLSAKFETEQKLEYTA
TLAIGASVVDQKIQFALFAGRNSDQYGSNQLNAVGRVTKPLSDSPINFQKALEKELKMDF
EADILYNQKENIHILGSAERTKRYIEELQKEPQVKRCLENYARGNYYQHDCHEAVVMAHA
PDNFKFSVSYKDVSSGTKNAAAYAYRILDGLNLWRSDINMAKTLPAGKLELNVDALYWTR
NLNLIVNSRFGELRVNNIPIPEVTSRAVSMYLPISAYERILNYYTWHQYQPYCSVDSNRV
RTFSNREYDYTLSPSWHVVMHDDRPGRNEDLVVLSRRPQEMKMQIYLSYRSYTGKYIEME
VQPAPDTQQKHSVQVKTNAKKVSEGELTTYWDDVNDSPLLEYYSTGDNVLMIKLRENRLR
IVYDGERSVVLSRDNRKNIRGICGRMSGDPRDDYLTPSGLVDKPEYYGASYALIEDENDP
RTQELQSEAKRKAYEPRKQYTTILQSDNKWQNAMLSSSEDDWDSQIVYRARNYGKSKGKC
KVVPQVQYYENQSQICITTSSLPSCQSSCSGGSYKIQSTQVVCRSKLDSQFQSYRDEIKL
GKSPKVSGEPRTVDYRVPSSCKS