Monarch geneset OGS2.0

DPOGS209980
TranscriptDPOGS209980-TA2247 bp
ProteinDPOGS209980-PA748 aa
Genomic positionDPSCF300148 + 233308-235953
RNAseq coverage53842x (Rank: top 0%)
Annotation
HeliconiusHMEL0096810.075.48% 
BombyxBGIBMGA011266-TA0.066.48% 
DrosophilaLsp1beta-PA1e-8430.04% 
EBI UniRef50UniRef50_Q063420.068.27%Basic juvenile hormone-suppressible protein 1 n=33 Tax=Ditrysia RepID=BJSB1_TRINI
NCBI RefSeqNP_001106747.20.066.34%sex-specific storage-protein 1 precursor [Bombyx mori]
NCBI nr blastpgi|1995812910.074.62%methionine-rich storage protein [Heliconius erato]
NCBI nr blastxgi|1995812910.074.10%methionine-rich storage protein [Heliconius erato]
Group
Gene OntologyGO:00068104.6e-121transport
GO:00053444.6e-121oxygen transporter activity
KEGG pathwaydme:Dmel_CG426407e-47 
 K00505 (E1.14.18.1)maps-> Riboflavin metabolism
    Betalain biosynthesis
    Isoquinoline alkaloid biosynthesis
    Tyrosine metabolism
    Melanogenesis
InterPro domain[81-719] IPR0137884.6e-121Arthropod hemocyanin/insect LSP
[162-437] IPR0008963e-96Hemocyanin, copper-type
[162-445] IPR0089226.3e-85Uncharacterised domain, di-copper centre
[446-698] IPR0052035.1e-82Hemocyanin, C-terminal
[446-699] IPR0147561.7e-80Immunoglobulin E-set
[33-161] IPR0052044.4e-43Hemocyanin, N-terminal
Orthology groupMCL25829 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209980-TA
ATGAGGTTCCTGGTGTTGGCGGCGATCATAGCCGTGGCTTCGGCTACTGTCATCAAGGACAGTTCCTTTGTCATTGGCAAGGATACTCTAGTAAATGTAGACATAAAAACCAAGGAGGTGTTATGTATGAAACTACTGAACTACATCCTGCAGCCGACGGTGTACGACGACATCCGAGATGTGGCCAGAGACTGGGTGTTAGAGGAGAACATTGACAAATACCTGAAAGTCGATGTCGTGAGGAGGTTCATCGAAATGTACAAGATGGGCTTCCTGCCCCGTGGTGAGGTTTTCGTACACACCAACGAGAAGCAGATGGACCAGGCCATCCAGACCTTCCGTCTTCTGTACTTCGCTAAGGACTTCGACACCTTCATAAGAACCGCCTGCTTCCTCCGCGAACGCATCAACGGAGGCATGTTCGTGTATGCCTTCACTTGCGCTGTCTTCCATCGCGAAGACTGTCGTGGAGTCTCCATTCCCGCTCCTTACGAGATCTACCCTTACTTCTTCGTGGACAGTCACATCATCAACAAGGCTTTCATGATGAAAATGACAAAAGCCATCACCGACCCCTTAGTTATGGATTACTATGGCATCAAAGTCACCGACAAGAACCTAGTTATCATCGACTGGCGCAAGAGCGTTCGTCATGTTCTTAGTGAAAATGACCGCTTGTCATACTTTACCGAGGACATCGATTTAAACACCTACTACTACTACTTGCATATGAATTATCCTTACTGGATGACTGATGACGTATACGGTCTTAACAAGGAGCGTCGGGGAGAGATCGCTATGTACGCCAACCAGCAACTGCTCGCCAGGTACAGGTTGGAGCGTCTGTCTCACAAAATGTGCGACGTCAAGATGATCATGTGGAACGAACCTCTGAAGAGCGGTTACTGGCCCAAGATCCGTATGCACACCGGTGATGAGATGCCCGTCCGCAGTAACTACGTTGAACTGGTTCACAAAAACAACCTTAAGGACAAAATGTACGTCGATGACGTCGAAAAACTTATCCGCATGGCTATTGTCACCGGAAAGTTTGAAATGCGCGACGGTACAGTACTTAATCTCCGGAAATCTGAGGACTTCGAAATCTTGGCCAGGATCTTGCTTGGTGGTATGGGTTTAAAGAATGACGATGCTAAAGTCATCCACATTGTAAACTTGTTCAAGAGACTGCTCTCTTACAGCAGTTACAACTTCGACAAGAACACCTACATCCCAACCGCTCTAGACATGTACACCACCTGCCTGCGCGACCCTGTTTTCTGGAGAATGATGAAGCGCATCACCGATTACGGCGTTCTCTTCAAGAAGTTCTTGCCTAAATACACCAAAGATGAATTAGACTTCCCTGGAGTTAAGATCGACCGCATTGTTACTGACAAATTGGTGACTTTCATGGATGAATATGACATGGATATAACCAACGCCATGTATCTTGACAAAACTGAGATCCAAAAGAAGAAGTCCGATATGGTATACGTTGCTCGTATGCGCCGTCTCAACCACCATTCCTTCAAGGTTTCAGTCGATGTGACCTCTGAGAAGGCTGTCGATGCTGTAGTCAGAATCTTTATTGGACCTAAATGGGACTGTATGGGACGCCTGCTCAGCTACAACGACAAACGTTTGGATATGGTTGAAATCGATAGTTTCTTATACAAACTGGAGACTGGAAAGAACACCATCGTTCGCAGCTCGATGGAGATGCACGGCGTCATTGGCGACAGGATGATGACTCGTCGCATGATGGACAACACCGTGGACACCACTGGCTCAATGGAAAGGATAGTAGACAGCTTTTGGTACAAGAGCCGCCTCGGTTTCCCCCACCGTCTTCTGCTTCCTCTGGGTCACCGTGATGGTCTCGAGCTGCAGATGTTCGTGATTGTGACCCCAGTCCGCACCGGCCTCGTCCTGCCGTCCATCGACATGGGCGTTATGAAGGATCGTCGCGCTTGCCGCTGGAGCGTCTGCTTCGACACCATGCCTCTCGGATTCCCATTCGACAGAGAGATCGACATGAGCCACTTCGTCACCAACAACATGAAATTCCACGACATACTCGTGTTCAGGAAAGACTTGGACTTGTCCAACTCCGTCAAAGACATCGACACCTCTGACATGGTGATGATGCGCGACGACCTCACCTACCTGGACCGCGACATGCTCGTCAGGTGGTCCTACAAGGACGTCATGATGATGAGCACTGACAAGATGATGCGTCTTTAA

Protein sequence:

>DPOGS209980-PA
MRFLVLAAIIAVASATVIKDSSFVIGKDTLVNVDIKTKEVLCMKLLNYILQPTVYDDIRDVARDWVLEENIDKYLKVDVVRRFIEMYKMGFLPRGEVFVHTNEKQMDQAIQTFRLLYFAKDFDTFIRTACFLRERINGGMFVYAFTCAVFHREDCRGVSIPAPYEIYPYFFVDSHIINKAFMMKMTKAITDPLVMDYYGIKVTDKNLVIIDWRKSVRHVLSENDRLSYFTEDIDLNTYYYYLHMNYPYWMTDDVYGLNKERRGEIAMYANQQLLARYRLERLSHKMCDVKMIMWNEPLKSGYWPKIRMHTGDEMPVRSNYVELVHKNNLKDKMYVDDVEKLIRMAIVTGKFEMRDGTVLNLRKSEDFEILARILLGGMGLKNDDAKVIHIVNLFKRLLSYSSYNFDKNTYIPTALDMYTTCLRDPVFWRMMKRITDYGVLFKKFLPKYTKDELDFPGVKIDRIVTDKLVTFMDEYDMDITNAMYLDKTEIQKKKSDMVYVARMRRLNHHSFKVSVDVTSEKAVDAVVRIFIGPKWDCMGRLLSYNDKRLDMVEIDSFLYKLETGKNTIVRSSMEMHGVIGDRMMTRRMMDNTVDTTGSMERIVDSFWYKSRLGFPHRLLLPLGHRDGLELQMFVIVTPVRTGLVLPSIDMGVMKDRRACRWSVCFDTMPLGFPFDREIDMSHFVTNNMKFHDILVFRKDLDLSNSVKDIDTSDMVMMRDDLTYLDRDMLVRWSYKDVMMMSTDKMMRL-