New model in OGS2.0 | DPOGS216049  |
---|---|
Genomic Position | scaffold665:+ 56696-69543 |
See gene structure | |
CDS Length | 4224 |
Paired RNAseq reads   | 56965 |
Single RNAseq reads   | 165876 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA009028 (2e-63) |
Best Drosophila hit   | larval serum protein 2 (5e-83) |
Best Human hit | ND |
Best NR hit (blastp)   | arylphorin-type storage protein [Pieris rapae] (0.0) |
Best NR hit (blastx)   | arylphorin-type storage protein [Pieris rapae] (0.0) |
GeneOntology terms    | GO:0005616 larval serum protein complex GO:0045735 nutrient reservoir activity GO:0005576 extracellular region GO:0005344 oxygen transporter activity GO:0006810 transport GO:0005811 lipid particle |
InterPro families    | IPR013788 Arthropod hemocyanin/insect LSP IPR000896 Hemocyanin, copper-containing IPR008922 Di-copper centre-containing IPR014756 Immunoglobulin E-set IPR005204 Hemocyanin, N-terminal IPR005203 Hemocyanin, C-terminal |
Orthology group | MCL10104 |
Nucleotide sequence:
ATGGCGCGTGTCGTACTAGGTGTACTCGCCATCCTGGTGGCTCAGGCCCTGGGAGAAGTC
ATCCCTAAAAGCGTTGTACCACCAAAAGTTGTTGACAATGTCATTTTACAACGCCAGATT
GAACTATCCAATTTGTTCAACCGCATTAGTGATCCAGTCCAAAGCTTGGAATTAAAGTCG
ATAGTTGAATCATTTGACTGGGAGAAGAATATTGAACATTACAGCAATGTGACTGCTGTT
AAAGAATTCATTTACTTGCTCGAACACGATATTCTCCAGCCACGTTGGTTTTCCTTCTCT
CTAAGTGAGCCCGCAGTACGTCGTGAAGCTGAAATTGCCTTCAATTTGTTATACAGTGCC
AAGGACTATTCATCATTCTATAAGGCAGCTGTTTACTTAAGGCAACATCTGAATGAAGCA
ATTTTTGTTTACGTATTGTCTGTCGCTATTCTTTACCACCCCGAGACCCAGGGTATCGTT
GTACCACCAGTATATGAAATTTTCCCATCTTACTTCCACAATGCAGAAATCATTAACTTG
GCCAATAAGATTAATGTACATGGAAAGGAATCTGTAAAGAACTACCCGCAAAGTTACATG
TGGGATGAGAACGTAGTGATCAGGTGGAACGAAACTGTTTGGCCATATGCCGGACAGGAA
AGCGCACCAATGTCATACTTTATGAATGATTACTCACTGAACGCTATCATTTACAACAAT
CATCTTAGACAACCATTTTGGCTTGACGAGTCTATAGTTTCCCAACATAAACTAAATTGG
GGAGCCTACAATCTTTTCTTCTATAAACAAATTGTCAGCCGTTATTACTTGGAAAGACTT
TCCAATGGTCTCGGAGAAATTCCTCTTTTGAACTGGGATGTTGTAGAGGAAGGTTATTCA
AGCGGTTTGGTCCATTACAACGGTGTTCCCTTGCCAATCAGGCCTGATTACTACCAACTT
AACCAACCTAAGCTTCTTGAATCTGTAGAACAGCTGAAGATCTATGAGCGTAGAATTCGT
GAAGCCATTCAACTAGGATATGTTATTGGTGAAAACGGTGACAAGATTGATTTGCGCAGA
TCAGAAGCCGTGGACGTAATTTGGAATATTATCGAAGGAAACGACTATTCTCCCAATTTG
GGTTACTATGGTAGCATTTTGAGCTCATGGAAAAAACTACTTGGCAATGCTATTGTATCT
CGTAAAATGTGGTGGAAAGGATACGTGCCTCTTGTTATGTCCTCTGTTTTGGAAATTCCA
TGTGCAGAAGCTCGTGATCCCGCTGTTTATATGATTTGGAAACGTATCGTTAACTTGTAC
GACCTGTGGGTCTCATATTTGCCAAAATACAGAGTAGAGGATTTGGCTGTACCTGATGTT
CAAATCCAAAAAGTTGAGGTTGATAAACTTGTAACTTACTTTGAACAGAGCTACGTCAAC
ATCTCTAACGTCTTACCATACAATGTGGTTGAGTCTAAGGTAAGCCCAGTGAGCGTTTTG
GTCCAACGGCCGCAGTTAAACAACAAAGTATTTAAAGTACGCGTTAATGTCAAGAGTGAT
GTTTCTAAAAAAGTGGTAGTCAAGTTCTTTGTGGGTCCCAAATACGACAGCAAGGGCTTG
GAAATTCCATTGCAAGAAAATTCACAGAACTTCTTCCAAATTGATCAATTCATTTACGAA
TTGCCAGCAGGAGAATGTGTGATTAAACGTGAGTCCAGTAGCAACTCATACATGATCGAC
CAATGGTTGTCTAATTCTGAAATTGTAAGCAAGGTTAGCAATGTTCTTCGCGGTAACGGT
CAGTGGGTAGTTGATGTAAACAACTTCTACAGTGGATTCCCTCGTCATCTGATGTTACCT
AAGGGTCGTATCGGTGGTATGCCGTTCCAATTCTTAGTCTTCATCAGTGACTACAAACCA
TATAATGGATTTTCTGGAAGCTGGAGCGGTGCTAGTCCTATGCGCGCTGTATACGAGCCC
TACGGTTACCCGTTGAACAGACCGCTGAATGACATGTGGATACACAAACTTCCTAACTTA
CACATTAAGGAAGTTCAAATCTATCACAAGCCCACGCCTGAAATTGTTGCACAGTTGTCT
TCTGGTCTTGTCAACATGAAGACTGTCTTGGCATTTGCTTGCCTGGCCCTAGCCTTAGCG
GGCGCTGTGGTGGTTCCAAATAAACCCGTCTACAAAATTAAGTCTGTGGACAATGATTTT
GTTGTAAAACAGAAGAGAATTTTCAACCTCTTCATCCACCCTGAGCAAGTAGATCCTGAA
GCAGAATACTATCATGTCGGCAAGGACTACGATATTGAGGCCCACATTGATGACTATTCC
AACAAGAAGGTTGTCCAAGAATTCTTAGACTTATGGAAGTCTGGATTTTTACCAAAGAAC
GTTCCTTTCTCTGTCTTCTATGAAAGACAAAGGGAAGAAGTAGTTGCCCTGTTCAATATT
TTGTATAGTGCTAAAGATTTTGAAATCTTCTACAAGACCGCTGCCTTTGCGCGTGTCCAC
ATTAACGAAGGACAATTCCTGTATGCATACTACATTGCTTTGATCCACCGTGCCGACACT
AAGGGCTTTGTTGTACCTGCTCCTTACGAAATTTACCCCGAACTCTTCACCAACTCAAAT
GTTTGGTACAAGATCTTCCGTATTAAGATGCAGAATGGTATCTTCTCACCCGACTTCGGA
TCTGAGGACGGAATTGTCCACGAAGGAGATCGCTACGTGGTATACTCTAACTATTCCGAC
TACCTAACGTACCATAACGATGAACACAGGATTTCATACTTTACTGAAGATGTCGGTTTT
AACGCTTTCTACTACTACTTCCAATCCTACTTCCCCTTCTGGATGGACGGTGACTTTTTC
CCAGTAATAAAGGACCGTCGTGGAGAAATCTACTACTACGTCCACCAACAGCTGTTGGCT
CGTTACTACCTGGAACGTCTTTCAAATGGATTAGGTGAAATTCCTGATTTCTCTTGGTGG
CAGCCTATTAGGAGCGGTTACAGCCCCTACGTAAACTACTTCCACTCCTTTGTTCAAAGA
CCCTCGTACTACCAAATCCCCTATGAAAAGAATGAAGAACTTCAACTTTTGGATACCTAT
GAGAAGACTTTCATTCAATATCTTGAACAAGGTCATATCAATTCTGTTAATCAGGAGGTC
GATCTTCACAACTCTAAGTCAATCAACTTCGTGGGCAACTTCTGGCAAGCTAATGCTGAT
ATGTGGGGTAAAGGTGGACGCAAGGACAACCACAACTCCTTCGAAGTTACAGCTCGTCGT
ATTCTTGGTGCTGCTCCTGAACCCGTGGACAAATATAACTTTGTGCCAAGCGCTCTGGAC
TTCTACCAAACTTCTCTTCGTGACCCCATCTTCTACCAATTGTACAGCAAGATTCTTAAA
TACATCGTTGAGTACAAGAAGTTCCTGGCTCCTTACAACCAGGATAACTTACACTACGTT
GGAGTTAAAATCAATGACGTTAAGGTAGACAAATTAGTTACATACTTTGACTACTTCGAC
TACGATGTATCCAATAACGTATTCTATAACAAAGAAGAGCTCAAGTCGCAACAGTATCCT
TGGTACGTAGTACGTCAACCTCGTCTGAACCACAAGCCATTCAATGTAAATATCGATGTT
AAGTCTGACGTTGAAGGTGAAGCTGTGTTCAAAATCTTTATTGGACCTAAATACAACAGC
AAGGGTTATCCTATTTCCCTCGAAGACAACTGGCAAAACTTTGTTGAATGGGACTGGTTC
GTACACAAGCTCAACAAGGGACAGAACAAGATTCAGCGCCAATCTAGTGATTTCTTCTAC
TACAAAGATGATTCCGTCCCTGTCCGTGATGTTCTTAAACTTCTTGAAGAATCTAAAATC
CCTGCTGATATGGCCAACGAGTATGGTTCCTTCCCCAAACGGTTACTTGTTCCTAAAGGA
TCTCTTGGTGGTTTCCCCTACCAGATATTTGTGATGGTCTACCCGTACTCTCCAGTTGAT
AAGAAATTCGAAGGTTACAAGAGTTTTGCTGCGGATAACAAGCCTTATGGTTATCCATTC
GACCGCCCAGTTCGTGAATCTTACTTTAAGCAACCTAACATGTTCTGGGAGGATGTTGTG
GTTTACCATGAAGGAGAAGAGATGGCCTACAAATACAACATTCCCTACTATTCAATTCAT
CACAATGAAGTTGTCAAACACTAA
Protein sequence:
MARVVLGVLAILVAQALGEVIPKSVVPPKVVDNVILQRQIELSNLFNRISDPVQSLELKS
IVESFDWEKNIEHYSNVTAVKEFIYLLEHDILQPRWFSFSLSEPAVRREAEIAFNLLYSA
KDYSSFYKAAVYLRQHLNEAIFVYVLSVAILYHPETQGIVVPPVYEIFPSYFHNAEIINL
ANKINVHGKESVKNYPQSYMWDENVVIRWNETVWPYAGQESAPMSYFMNDYSLNAIIYNN
HLRQPFWLDESIVSQHKLNWGAYNLFFYKQIVSRYYLERLSNGLGEIPLLNWDVVEEGYS
SGLVHYNGVPLPIRPDYYQLNQPKLLESVEQLKIYERRIREAIQLGYVIGENGDKIDLRR
SEAVDVIWNIIEGNDYSPNLGYYGSILSSWKKLLGNAIVSRKMWWKGYVPLVMSSVLEIP
CAEARDPAVYMIWKRIVNLYDLWVSYLPKYRVEDLAVPDVQIQKVEVDKLVTYFEQSYVN
ISNVLPYNVVESKVSPVSVLVQRPQLNNKVFKVRVNVKSDVSKKVVVKFFVGPKYDSKGL
EIPLQENSQNFFQIDQFIYELPAGECVIKRESSSNSYMIDQWLSNSEIVSKVSNVLRGNG
QWVVDVNNFYSGFPRHLMLPKGRIGGMPFQFLVFISDYKPYNGFSGSWSGASPMRAVYEP
YGYPLNRPLNDMWIHKLPNLHIKEVQIYHKPTPEIVAQLSSGLVNMKTVLAFACLALALA
GAVVVPNKPVYKIKSVDNDFVVKQKRIFNLFIHPEQVDPEAEYYHVGKDYDIEAHIDDYS
NKKVVQEFLDLWKSGFLPKNVPFSVFYERQREEVVALFNILYSAKDFEIFYKTAAFARVH
INEGQFLYAYYIALIHRADTKGFVVPAPYEIYPELFTNSNVWYKIFRIKMQNGIFSPDFG
SEDGIVHEGDRYVVYSNYSDYLTYHNDEHRISYFTEDVGFNAFYYYFQSYFPFWMDGDFF
PVIKDRRGEIYYYVHQQLLARYYLERLSNGLGEIPDFSWWQPIRSGYSPYVNYFHSFVQR
PSYYQIPYEKNEELQLLDTYEKTFIQYLEQGHINSVNQEVDLHNSKSINFVGNFWQANAD
MWGKGGRKDNHNSFEVTARRILGAAPEPVDKYNFVPSALDFYQTSLRDPIFYQLYSKILK
YIVEYKKFLAPYNQDNLHYVGVKINDVKVDKLVTYFDYFDYDVSNNVFYNKEELKSQQYP
WYVVRQPRLNHKPFNVNIDVKSDVEGEAVFKIFIGPKYNSKGYPISLEDNWQNFVEWDWF
VHKLNKGQNKIQRQSSDFFYYKDDSVPVRDVLKLLEESKIPADMANEYGSFPKRLLVPKG
SLGGFPYQIFVMVYPYSPVDKKFEGYKSFAADNKPYGYPFDRPVRESYFKQPNMFWEDVV
VYHEGEEMAYKYNIPYYSIHHNEVVKH