New model in OGS2.0 | DPOGS213644  |
---|---|
Genomic Position | scaffold543:- 42481-48765 |
See gene structure | |
CDS Length | 5292 |
Paired RNAseq reads   | 2421 |
Single RNAseq reads   | 6017 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004585 (0.0) |
Best Drosophila hit   | ND |
Best Human hit | ND |
Best NR hit (blastp)   | vitellogenin [Actias selene] (0.0) |
Best NR hit (blastx)   | vitellogenin [Actias selene] (0.0) |
GeneOntology terms    | GO:0003674 molecular_function GO:0005576 extracellular region GO:0008150 biological_process |
InterPro families    | IPR015816 Vitellinogen, beta-sheet N-terminal IPR011030 Vitellinogen, superhelical IPR001747 Lipid transport protein, N-terminal IPR001846 von Willebrand factor, type D domain IPR015819 Lipid transport protein, beta-sheet shell IPR015255 Vitellinogen, open beta-sheet |
Orthology group | MCL11119 |
Nucleotide sequence:
ATGAAGCTGTTGGTCCTAGCGGCCACTATTGTGGTCGTTTCATCCGGTCAACTTAGTGAC
GTTGTTGTGGAGTCGCCATGGCCCTGGCAAGTCGGAAAATTATATCGCTACGATGTGGAA
ACACATACCCTGGCACGTTATTTAGACAGTTTCAGTTCGGGCAACGCTTTCAGAGCCAAG
TTTACAGTGCGAGCTAAGTCAGACGGCTACCTACAGGCACGGTTGGAGAATCCAGAATAT
GCCAAAGTCTACCAAAAGCTAGAACAACACGATCCCATGCCAGAAGATCTGAAATACGTA
CCTGTGGCAAATTTGGACAAACCGTTTGAGATTTACATAGAAGGTGGAAGAATACTTTCA
GTAAAATTACCATTTTCTGTAACACTCATGCAAGAAAACTTGATTAAAGGTCTAATCGGT
TCGCTTCAAGTAGATCTCACCAGTCATCGTAATGTAAAAAGCTCCCATGACACGTACGAT
ACTCAAGTACAACAGGGATTATTCCGCAAGATGGAAATCGATGTGACGGGCGATTGTGAA
ACTCTTTACACAGTATCTCCTGCTGCGTCTGAATGGAGACGTGAACTTCCCAGTTTTGCT
TCCGACGATGAACCAATTGAAATAACTAAAAGCAAGAACTACGGTCATTGTCATCATCGT
GTAGATTATCACTTCGGTGTACCGGAAGGAGCAGAATGGTCTGGCACTGCCCACAAAACT
GGAAAGGAACAATTCATAAATCGTGCTACGGTCTCAAGAATGCTGGTTGGGAAGAATGGT
CACATATACAAAGCTGAAACAACCAGTACGGTTACTGCTCATCCCCATTTATATGGGGAA
CAAAAGGCACAGGTACACGGCAAAGTGCGTTTTAATTTAATGTCGTATGAGGATGATAAT
GAACCGGCATGGGTATACCCCGAAGGTGCGCGTGAAGTTACCAACTTATTATATGCTTTG
ACGGCAAAACCAATTGATATCGGTGATAGTTCGTCGTCTGAAAAGTCTATAAAAATTGAG
AAACATCCGAGGCAACGTCGCTCCAGTCGTATGAAATCTTTCGTCTCCATAAATAAGAAG
ATTGTTACTGAAACACATGGATCTTCCAGTTCCAGTGAATCAGACTCAGTATATGTAAAT
GATGACATTCCCAATATCAACGAACCCGCCTATGCTTCTCTCTATATGAACCCAGATCTT
CATGGTGATAAGAAACAGAATCCCATGAATGCTCAGAAGCTTTTACAAGAAATCGCCCAA
CAATTGCAAAATCCGAACAATATGCCGAAAGCGGATACCTTATCCAAATTTAATATTCTA
GTTCGTGTCATCGCCAGTATGAGTTATGGACAGCTCGGTCTGACAAGCCGCAGCATTGAA
ATTGCTAAGTTGGCTAATGATGTCGTGAAGTCTAACATGTGGATGATCTACAGAGATGCT
GTCGCCCAAGCCGGTACTCTGCCCGCATTCCAACAGATAAAGGCTTGGATTGAAAGCAAA
AAATTAGAAGGAGAAGAGGCGGCGGAAGTTATTTCCGTGCTTGCAGTATCTCTAAGGTAT
CCCACGAAGGTGGTCATGAAACAATTCTTTGATCTCGCCATGAACCCCGAGGTAACTAAA
CAGATGTTCCTTAATGACACTGCACTAATCGCTGCTGCTAAATTAATAAACATGGGACAA
GTAAACAATGAAACTGTGCATCGTTACTATCCGACACATATGTACGGACGTCCATCACCT
AAGGAAGATGCCTTCGTGATTAATGAAATTCTTCCCCGTCTGAGTCAGGAGCTTCAACTG
GCTATTGAAAATGGGGATAGTCGAAAATCACAAGTATATATTAAGGCTATCGGCGAACTT
GGTCACCCAGCTATCCTGGATATATTTAAACCGTACCTTGAAGGCAAAATTCCGGCTTCA
ACTTATCTTAGAACCAGAATCATAGAACATCTCTATGTTCTGGCCAAAGGAAGGGATGAT
TATGTACGTGCTGTGTTATTTAGCGTTTTGAAAAATACTGCTGAACCATATGAAGTAAGA
GTAGCAGCCATCGATAAAATCTTTATGTCACGACCAAGTACAGCGATGATGATGGCAATG
GCACAAATGACTAAAGACGATCCTAGTATCCAAATCCGTGCAGCGCTTAAATCGGCAATT
ACATCTGCATCAGAACTTAAAAATCCAAGATTCCATGACCTGGCAAGAACAGCAGCAGCT
GTTAAGGATATGCTCACAAGTGAAGAGTTTGGTTTACAATACTCTGGTAAAAACTTCCTG
GAACACTACGACAGGGATGAGCAACCAAGTTCTATGTCAGTACTCTCAAGACTGGGAAGC
AAGGATAGTCTGCTTCCGAAATATTGGAGATATTCATGGAAAGGAAGAGACGGAGGTTGG
GATCAAGAAACAGTTATCTCAGGAGCTGCTTCAAGTTGGCAGGAACTATTTGATCTCTTC
GCAGATCAGATGTTTGGACAAAGAAAACCCGATCAATATCCCGAATACAATCCTAAATAC
TCCGCTGAAAAGATTGCTGACATGTTGAACGTAAAAAAAGACGACCGAGAATCATCAGAG
GGCTCATTTTATATAAATTTACTAAATCAGAGGAGATACTTTGCTTTCAATGAAAATGAT
GTTAAAGAATTAGGCATTAAATTTCGCGAGTACTTAACAAATCTCAAAGACGTTGCTAAG
CAATACACTAAAGTCGTTAACAGGAACCAAGTGTCAGTCATGTTCCCTATAGCTACAGGA
GTACCATTTATTTATAAATATAAGGAACCGGTTCTCCTACATGTTCGTACTGTAACTAAA
GGAAACGTTGATTTTAAGGATAGAGAGGAATATAGGTCTAGTGCTTCTATCAATAGCGAG
CTGCGGATAATTTACGCTGAAAATCATGATGGCAATGTTGGTTTTCTAGACACTCTTGGT
AATCAACTTGCAAGCGTTGGATTAGTGAGAAAAAGTCAACTTAATATTCCAATTAAAATA
GATCTTGAAATGAAATCTGGAGAAGCGAAGTTCCATTTAAGTCCAATGGAACCCGAACAA
GATAATACTATAGCTCATTACAGTGTTTGGCCATATTCCGCAAACCAAAAGAAGGACACT
TTAACACCTATTTCTCAGGATCCTATATCAAGAGTTATTATGAGACCCGAAAAAGTAGCC
CAGATTGATAGCAAGTTTGGACAAAACTTTGGATCCATATTCCAACTCCAGGGTTATTCT
TATTCTGAAGATTACAGGTACATAGGAGACATGCTGAAGTCCTACAATTATTTAACTAGT
ATTATCAGGATGTTCAAGCAAAAAGATATAGCTCAAACTCACTTTAATCTGAGGTACTTG
GGAAAGCAATCTAAGAACAAAGGAGTCACAATCACAGTAGCTTACGACACACTGTATAAT
CAGAAAGAAACAGGCGTTATGCCAATAACTGCATCGGATGTGAAGGACTCGACACCCAAC
AGTCCATCACGACGAGAGGAATTAATTAAACGTGTTATAGCTGGCATACAATCATCTAGA
GCCCACGTCGTTGATTTGAGCGCAAAATTCGAGACAGAACAAAAATTGGAGTATACTGCG
ACCCTTGCAATCGGCGCGAGTGTCGTCGATCAAAAAATTCAGTTTGCTTTATTTGCTGGT
AGAAACTCTGATCAATACGGATCAAATCAGTTAAATGCCGTAGGTAGAGTTACGAAACCA
TTGTCAGATTCCCCTATTAATTTCCAAAAAGCACTAGAAAAAGAACTGAAAATGGATTTT
GAGGCCGATATCCTTTACAACCAGAAAGAAAATATCCACATTCTTGGCTCTGCCGAAAGA
ACAAAGAGATATATAGAAGAACTTCAGAAAGAACCACAAGTAAAGAGATGTCTTGAAAAT
TATGCCAGAGGTAATTATTACCAACACGACTGTCATGAAGCGGTTGTTATGGCCCATGCT
CCAGACAACTTCAAATTCAGTGTAAGTTACAAAGACGTCAGCTCTGGGACTAAAAATGCT
GCAGCCTACGCTTACAGAATTTTAGACGGGCTTAATTTATGGAGATCGGATATTAATATG
GCAAAGACGTTACCTGCTGGAAAACTTGAATTGAACGTTGATGCTTTATACTGGACAAGA
AATTTAAATCTTATTGTAAATTCTCGTTTTGGGGAATTGCGGGTAAACAATATACCTATA
CCTGAAGTTACTTCTAGAGCTGTGTCTATGTACTTACCGATCAGCGCCTATGAGCGAATT
CTAAATTATTACACCTGGCATCAGTATCAACCATATTGCAGTGTGGACAGTAACAGGGTG
AGGACCTTCAGTAACCGTGAATATGATTACACGCTGTCACCTTCCTGGCACGTAGTGATG
CACGATGACAGACCCGGCAGAAACGAGGATTTAGTCGTGCTGTCCAGAAGACCTCAAGAA
ATGAAAATGCAAATATACTTATCTTACAGATCTTACACTGGCAAATACATAGAGATGGAA
GTTCAACCAGCCCCGGACACTCAACAGAAGCACTCTGTTCAAGTCAAGACCAATGCCAAA
AAAGTGTCTGAAGGAGAACTTACAACCTACTGGGACGACGTCAATGACAGTCCGTTACTT
GAATACTACAGTACTGGCGACAATGTCTTAATGATCAAATTGCGTGAGAATCGTCTCAGA
ATCGTGTATGACGGAGAAAGGAGCGTAGTTCTTTCGAGAGACAACCGCAAAAACATCAGG
GGAATTTGTGGAAGAATGAGCGGTGATCCTCGCGATGACTACCTAACACCTAGTGGTCTC
GTAGATAAACCAGAATACTATGGAGCTTCCTACGCTCTTATTGAAGACGAGAATGATCCC
AGAACACAAGAATTGCAATCGGAAGCTAAAAGAAAGGCGTACGAGCCAAGAAAACAATAC
ACCACAATCTTGCAATCTGATAACAAATGGCAAAATGCTATGCTCTCTTCGTCTGAAGAT
GATTGGGACTCTCAGATCGTATACAGGGCAAGGAACTATGGAAAGAGTAAGGGAAAATGT
AAAGTAGTCCCTCAAGTGCAGTATTATGAGAACCAATCACAGATCTGTATAACCACCAGT
TCCTTACCGTCCTGCCAGTCTTCCTGTAGCGGAGGCAGCTACAAGATTCAGTCGACACAA
GTTGTTTGCCGCTCCAAGCTGGACTCTCAATTCCAATCTTACAGAGATGAAATCAAACTA
GGCAAAAGTCCCAAAGTCAGCGGAGAGCCGCGAACTGTAGACTACAGAGTCCCTAGTTCT
TGCAAATCCTAA
Protein sequence:
MKLLVLAATIVVVSSGQLSDVVVESPWPWQVGKLYRYDVETHTLARYLDSFSSGNAFRAK
FTVRAKSDGYLQARLENPEYAKVYQKLEQHDPMPEDLKYVPVANLDKPFEIYIEGGRILS
VKLPFSVTLMQENLIKGLIGSLQVDLTSHRNVKSSHDTYDTQVQQGLFRKMEIDVTGDCE
TLYTVSPAASEWRRELPSFASDDEPIEITKSKNYGHCHHRVDYHFGVPEGAEWSGTAHKT
GKEQFINRATVSRMLVGKNGHIYKAETTSTVTAHPHLYGEQKAQVHGKVRFNLMSYEDDN
EPAWVYPEGAREVTNLLYALTAKPIDIGDSSSSEKSIKIEKHPRQRRSSRMKSFVSINKK
IVTETHGSSSSSESDSVYVNDDIPNINEPAYASLYMNPDLHGDKKQNPMNAQKLLQEIAQ
QLQNPNNMPKADTLSKFNILVRVIASMSYGQLGLTSRSIEIAKLANDVVKSNMWMIYRDA
VAQAGTLPAFQQIKAWIESKKLEGEEAAEVISVLAVSLRYPTKVVMKQFFDLAMNPEVTK
QMFLNDTALIAAAKLINMGQVNNETVHRYYPTHMYGRPSPKEDAFVINEILPRLSQELQL
AIENGDSRKSQVYIKAIGELGHPAILDIFKPYLEGKIPASTYLRTRIIEHLYVLAKGRDD
YVRAVLFSVLKNTAEPYEVRVAAIDKIFMSRPSTAMMMAMAQMTKDDPSIQIRAALKSAI
TSASELKNPRFHDLARTAAAVKDMLTSEEFGLQYSGKNFLEHYDRDEQPSSMSVLSRLGS
KDSLLPKYWRYSWKGRDGGWDQETVISGAASSWQELFDLFADQMFGQRKPDQYPEYNPKY
SAEKIADMLNVKKDDRESSEGSFYINLLNQRRYFAFNENDVKELGIKFREYLTNLKDVAK
QYTKVVNRNQVSVMFPIATGVPFIYKYKEPVLLHVRTVTKGNVDFKDREEYRSSASINSE
LRIIYAENHDGNVGFLDTLGNQLASVGLVRKSQLNIPIKIDLEMKSGEAKFHLSPMEPEQ
DNTIAHYSVWPYSANQKKDTLTPISQDPISRVIMRPEKVAQIDSKFGQNFGSIFQLQGYS
YSEDYRYIGDMLKSYNYLTSIIRMFKQKDIAQTHFNLRYLGKQSKNKGVTITVAYDTLYN
QKETGVMPITASDVKDSTPNSPSRREELIKRVIAGIQSSRAHVVDLSAKFETEQKLEYTA
TLAIGASVVDQKIQFALFAGRNSDQYGSNQLNAVGRVTKPLSDSPINFQKALEKELKMDF
EADILYNQKENIHILGSAERTKRYIEELQKEPQVKRCLENYARGNYYQHDCHEAVVMAHA
PDNFKFSVSYKDVSSGTKNAAAYAYRILDGLNLWRSDINMAKTLPAGKLELNVDALYWTR
NLNLIVNSRFGELRVNNIPIPEVTSRAVSMYLPISAYERILNYYTWHQYQPYCSVDSNRV
RTFSNREYDYTLSPSWHVVMHDDRPGRNEDLVVLSRRPQEMKMQIYLSYRSYTGKYIEME
VQPAPDTQQKHSVQVKTNAKKVSEGELTTYWDDVNDSPLLEYYSTGDNVLMIKLRENRLR
IVYDGERSVVLSRDNRKNIRGICGRMSGDPRDDYLTPSGLVDKPEYYGASYALIEDENDP
RTQELQSEAKRKAYEPRKQYTTILQSDNKWQNAMLSSSEDDWDSQIVYRARNYGKSKGKC
KVVPQVQYYENQSQICITTSSLPSCQSSCSGGSYKIQSTQVVCRSKLDSQFQSYRDEIKL
GKSPKVSGEPRTVDYRVPSSCKS