Monarch geneset OGS2.0

DPOGS216049
TranscriptDPOGS216049-TA4224 bp
ProteinDPOGS216049-PA1407 aa
Genomic positionDPSCF300067 - 151309-164156
RNAseq coverage11240x (Rank: top 1%)
Annotation
HeliconiusHMEL0150490.051.37% 
BombyxBGIBMGA009028-TA0.054.26% 
DrosophilaLsp2-PA8e-9333.76% 
EBI UniRef50UniRef50_G6DKT40.0100.00%Arylphorin-type storage protein n=2 Tax=Obtectomera RepID=G6DKT4_DANPL
NCBI RefSeqNP_001037590.10.054.26%sex-specific storage-protein 2 [Bombyx mori]
NCBI nr blastpgi|1944005430.064.20%arylphorin-type storage protein [Pieris rapae]
NCBI nr blastxgi|1944005430.064.45%arylphorin-type storage protein [Pieris rapae]
Group
Gene OntologyGO:00068106.4e-137transport
GO:00053446.4e-137oxygen transporter activity
KEGG pathwayder:Dere_GG228221e-51 
 K00505 (E1.14.18.1)maps-> Riboflavin metabolism
    Betalain biosynthesis
    Isoquinoline alkaloid biosynthesis
    Tyrosine metabolism
    Melanogenesis
InterPro domain[796-1405] IPR0137886.4e-137Arthropod hemocyanin/insect LSP
[867-1143] IPR0008964.7e-85Hemocyanin, copper-type
[867-1149] IPR0089222.9e-81Uncharacterised domain, di-copper centre
[1150-1385] IPR0147563.5e-74Immunoglobulin E-set
[1151-1384] IPR0052031.2e-71Hemocyanin, C-terminal
[738-866] IPR0052044.6e-47Hemocyanin, N-terminal
Orthology groupMCL10073 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216049-TA
ATGGCGCGTGTCGTACTAGGTGTACTCGCCATCCTGGTGGCTCAGGCCCTGGGAGAAGTCATCCCTAAAAGCGTTGTACCACCAAAAGTTGTTGACAATGTCATTTTACAACGCCAGATTGAACTATCCAATTTGTTCAACCGCATTAGTGATCCAGTCCAAAGCTTGGAATTAAAGTCGATAGTTGAATCATTTGACTGGGAGAAGAATATTGAACATTACAGCAATGTGACTGCTGTTAAAGAATTCATTTACTTGCTCGAACACGATATTCTCCAGCCACGTTGGTTTTCCTTCTCTCTAAGTGAGCCCGCAGTACGTCGTGAAGCTGAAATTGCCTTCAATTTGTTATACAGTGCCAAGGACTATTCATCATTCTATAAGGCAGCTGTTTACTTAAGGCAACATCTGAATGAAGCAATTTTTGTTTACGTATTGTCTGTCGCTATTCTTTACCACCCCGAGACCCAGGGTATCGTTGTACCACCAGTATATGAAATTTTCCCATCTTACTTCCACAATGCAGAAATCATTAACTTGGCCAATAAGATTAATGTACATGGAAAGGAATCTGTAAAGAACTACCCGCAAAGTTACATGTGGGATGAGAACGTAGTGATCAGGTGGAACGAAACTGTTTGGCCATATGCCGGACAGGAAAGCGCACCAATGTCATACTTTATGAATGATTACTCACTGAACGCTATCATTTACAACAATCATCTTAGACAACCATTTTGGCTTGACGAGTCTATAGTTTCCCAACATAAACTAAATTGGGGAGCCTACAATCTTTTCTTCTATAAACAAATTGTCAGCCGTTATTACTTGGAAAGACTTTCCAATGGTCTCGGAGAAATTCCTCTTTTGAACTGGGATGTTGTAGAGGAAGGTTATTCAAGCGGTTTGGTCCATTACAACGGTGTTCCCTTGCCAATCAGGCCTGATTACTACCAACTTAACCAACCTAAGCTTCTTGAATCTGTAGAACAGCTGAAGATCTATGAGCGTAGAATTCGTGAAGCCATTCAACTAGGATATGTTATTGGTGAAAACGGTGACAAGATTGATTTGCGCAGATCAGAAGCCGTGGACGTAATTTGGAATATTATCGAAGGAAACGACTATTCTCCCAATTTGGGTTACTATGGTAGCATTTTGAGCTCATGGAAAAAACTACTTGGCAATGCTATTGTATCTCGTAAAATGTGGTGGAAAGGATACGTGCCTCTTGTTATGTCCTCTGTTTTGGAAATTCCATGTGCAGAAGCTCGTGATCCCGCTGTTTATATGATTTGGAAACGTATCGTTAACTTGTACGACCTGTGGGTCTCATATTTGCCAAAATACAGAGTAGAGGATTTGGCTGTACCTGATGTTCAAATCCAAAAAGTTGAGGTTGATAAACTTGTAACTTACTTTGAACAGAGCTACGTCAACATCTCTAACGTCTTACCATACAATGTGGTTGAGTCTAAGGTAAGCCCAGTGAGCGTTTTGGTCCAACGGCCGCAGTTAAACAACAAAGTATTTAAAGTACGCGTTAATGTCAAGAGTGATGTTTCTAAAAAAGTGGTAGTCAAGTTCTTTGTGGGTCCCAAATACGACAGCAAGGGCTTGGAAATTCCATTGCAAGAAAATTCACAGAACTTCTTCCAAATTGATCAATTCATTTACGAATTGCCAGCAGGAGAATGTGTGATTAAACGTGAGTCCAGTAGCAACTCATACATGATCGACCAATGGTTGTCTAATTCTGAAATTGTAAGCAAGGTTAGCAATGTTCTTCGCGGTAACGGTCAGTGGGTAGTTGATGTAAACAACTTCTACAGTGGATTCCCTCGTCATCTGATGTTACCTAAGGGTCGTATCGGTGGTATGCCGTTCCAATTCTTAGTCTTCATCAGTGACTACAAACCATATAATGGATTTTCTGGAAGCTGGAGCGGTGCTAGTCCTATGCGCGCTGTATACGAGCCCTACGGTTACCCGTTGAACAGACCGCTGAATGACATGTGGATACACAAACTTCCTAACTTACACATTAAGGAAGTTCAAATCTATCACAAGCCCACGCCTGAAATTGTTGCACAGTTGTCTTCTGGTCTTGTCAACATGAAGACTGTCTTGGCATTTGCTTGCCTGGCCCTAGCCTTAGCGGGCGCTGTGGTGGTTCCAAATAAACCCGTCTACAAAATTAAGTCTGTGGACAATGATTTTGTTGTAAAACAGAAGAGAATTTTCAACCTCTTCATCCACCCTGAGCAAGTAGATCCTGAAGCAGAATACTATCATGTCGGCAAGGACTACGATATTGAGGCCCACATTGATGACTATTCCAACAAGAAGGTTGTCCAAGAATTCTTAGACTTATGGAAGTCTGGATTTTTACCAAAGAACGTTCCTTTCTCTGTCTTCTATGAAAGACAAAGGGAAGAAGTAGTTGCCCTGTTCAATATTTTGTATAGTGCTAAAGATTTTGAAATCTTCTACAAGACCGCTGCCTTTGCGCGTGTCCACATTAACGAAGGACAATTCCTGTATGCATACTACATTGCTTTGATCCACCGTGCCGACACTAAGGGCTTTGTTGTACCTGCTCCTTACGAAATTTACCCCGAACTCTTCACCAACTCAAATGTTTGGTACAAGATCTTCCGTATTAAGATGCAGAATGGTATCTTCTCACCCGACTTCGGATCTGAGGACGGAATTGTCCACGAAGGAGATCGCTACGTGGTATACTCTAACTATTCCGACTACCTAACGTACCATAACGATGAACACAGGATTTCATACTTTACTGAAGATGTCGGTTTTAACGCTTTCTACTACTACTTCCAATCCTACTTCCCCTTCTGGATGGACGGTGACTTTTTCCCAGTAATAAAGGACCGTCGTGGAGAAATCTACTACTACGTCCACCAACAGCTGTTGGCTCGTTACTACCTGGAACGTCTTTCAAATGGATTAGGTGAAATTCCTGATTTCTCTTGGTGGCAGCCTATTAGGAGCGGTTACAGCCCCTACGTAAACTACTTCCACTCCTTTGTTCAAAGACCCTCGTACTACCAAATCCCCTATGAAAAGAATGAAGAACTTCAACTTTTGGATACCTATGAGAAGACTTTCATTCAATATCTTGAACAAGGTCATATCAATTCTGTTAATCAGGAGGTCGATCTTCACAACTCTAAGTCAATCAACTTCGTGGGCAACTTCTGGCAAGCTAATGCTGATATGTGGGGTAAAGGTGGACGCAAGGACAACCACAACTCCTTCGAAGTTACAGCTCGTCGTATTCTTGGTGCTGCTCCTGAACCCGTGGACAAATATAACTTTGTGCCAAGCGCTCTGGACTTCTACCAAACTTCTCTTCGTGACCCCATCTTCTACCAATTGTACAGCAAGATTCTTAAATACATCGTTGAGTACAAGAAGTTCCTGGCTCCTTACAACCAGGATAACTTACACTACGTTGGAGTTAAAATCAATGACGTTAAGGTAGACAAATTAGTTACATACTTTGACTACTTCGACTACGATGTATCCAATAACGTATTCTATAACAAAGAAGAGCTCAAGTCGCAACAGTATCCTTGGTACGTAGTACGTCAACCTCGTCTGAACCACAAGCCATTCAATGTAAATATCGATGTTAAGTCTGACGTTGAAGGTGAAGCTGTGTTCAAAATCTTTATTGGACCTAAATACAACAGCAAGGGTTATCCTATTTCCCTCGAAGACAACTGGCAAAACTTTGTTGAATGGGACTGGTTCGTACACAAGCTCAACAAGGGACAGAACAAGATTCAGCGCCAATCTAGTGATTTCTTCTACTACAAAGATGATTCCGTCCCTGTCCGTGATGTTCTTAAACTTCTTGAAGAATCTAAAATCCCTGCTGATATGGCCAACGAGTATGGTTCCTTCCCCAAACGGTTACTTGTTCCTAAAGGATCTCTTGGTGGTTTCCCCTACCAGATATTTGTGATGGTCTACCCGTACTCTCCAGTTGATAAGAAATTCGAAGGTTACAAGAGTTTTGCTGCGGATAACAAGCCTTATGGTTATCCATTCGACCGCCCAGTTCGTGAATCTTACTTTAAGCAACCTAACATGTTCTGGGAGGATGTTGTGGTTTACCATGAAGGAGAAGAGATGGCCTACAAATACAACATTCCCTACTATTCAATTCATCACAATGAAGTTGTCAAACACTAA

Protein sequence:

>DPOGS216049-PA
MARVVLGVLAILVAQALGEVIPKSVVPPKVVDNVILQRQIELSNLFNRISDPVQSLELKSIVESFDWEKNIEHYSNVTAVKEFIYLLEHDILQPRWFSFSLSEPAVRREAEIAFNLLYSAKDYSSFYKAAVYLRQHLNEAIFVYVLSVAILYHPETQGIVVPPVYEIFPSYFHNAEIINLANKINVHGKESVKNYPQSYMWDENVVIRWNETVWPYAGQESAPMSYFMNDYSLNAIIYNNHLRQPFWLDESIVSQHKLNWGAYNLFFYKQIVSRYYLERLSNGLGEIPLLNWDVVEEGYSSGLVHYNGVPLPIRPDYYQLNQPKLLESVEQLKIYERRIREAIQLGYVIGENGDKIDLRRSEAVDVIWNIIEGNDYSPNLGYYGSILSSWKKLLGNAIVSRKMWWKGYVPLVMSSVLEIPCAEARDPAVYMIWKRIVNLYDLWVSYLPKYRVEDLAVPDVQIQKVEVDKLVTYFEQSYVNISNVLPYNVVESKVSPVSVLVQRPQLNNKVFKVRVNVKSDVSKKVVVKFFVGPKYDSKGLEIPLQENSQNFFQIDQFIYELPAGECVIKRESSSNSYMIDQWLSNSEIVSKVSNVLRGNGQWVVDVNNFYSGFPRHLMLPKGRIGGMPFQFLVFISDYKPYNGFSGSWSGASPMRAVYEPYGYPLNRPLNDMWIHKLPNLHIKEVQIYHKPTPEIVAQLSSGLVNMKTVLAFACLALALAGAVVVPNKPVYKIKSVDNDFVVKQKRIFNLFIHPEQVDPEAEYYHVGKDYDIEAHIDDYSNKKVVQEFLDLWKSGFLPKNVPFSVFYERQREEVVALFNILYSAKDFEIFYKTAAFARVHINEGQFLYAYYIALIHRADTKGFVVPAPYEIYPELFTNSNVWYKIFRIKMQNGIFSPDFGSEDGIVHEGDRYVVYSNYSDYLTYHNDEHRISYFTEDVGFNAFYYYFQSYFPFWMDGDFFPVIKDRRGEIYYYVHQQLLARYYLERLSNGLGEIPDFSWWQPIRSGYSPYVNYFHSFVQRPSYYQIPYEKNEELQLLDTYEKTFIQYLEQGHINSVNQEVDLHNSKSINFVGNFWQANADMWGKGGRKDNHNSFEVTARRILGAAPEPVDKYNFVPSALDFYQTSLRDPIFYQLYSKILKYIVEYKKFLAPYNQDNLHYVGVKINDVKVDKLVTYFDYFDYDVSNNVFYNKEELKSQQYPWYVVRQPRLNHKPFNVNIDVKSDVEGEAVFKIFIGPKYNSKGYPISLEDNWQNFVEWDWFVHKLNKGQNKIQRQSSDFFYYKDDSVPVRDVLKLLEESKIPADMANEYGSFPKRLLVPKGSLGGFPYQIFVMVYPYSPVDKKFEGYKSFAADNKPYGYPFDRPVRESYFKQPNMFWEDVVVYHEGEEMAYKYNIPYYSIHHNEVVKH-