Monarch geneset OGS2.0

DPOGS216056
TranscriptDPOGS216056-TA1275 bp
ProteinDPOGS216056-PA424 aa
Genomic positionDPSCF300067 + 148550-150441
RNAseq coverage1137x (Rank: top 11%)
Annotation
HeliconiusHMEL0150522e-14664.55% 
BombyxBGIBMGA008862-TA4e-7841.49% 
DrosophilaCG8193-PA8e-1426.43% 
EBI UniRef50UniRef50_Q16I899e-2932.14%Hexamerin 2 beta n=11 Tax=Culicidae RepID=Q16I89_AEDAE
NCBI RefSeqXP_001849286.14e-3234.93%arylphorin subunit alpha [Culex quinquefasciatus]
NCBI nr blastpgi|1700432017e-3134.93%arylphorin subunit alpha [Culex quinquefasciatus]
NCBI nr blastxgi|2827209891e-3628.49%hexamerin 4 precursor [Tribolium castaneum]
Group
Gene OntologyGO:00068106.6e-35transport
GO:00053446.6e-35oxygen transporter activity
KEGG pathwaydme:Dmel_CG426394e-08 
 K00505 (E1.14.18.1)maps-> Riboflavin metabolism
    Betalain biosynthesis
    Isoquinoline alkaloid biosynthesis
    Tyrosine metabolism
    Melanogenesis
InterPro domain[151-363] IPR0089221.7e-36Uncharacterised domain, di-copper centre
[153-362] IPR0008966.6e-35Hemocyanin, copper-type
[86-268] IPR0137885.2e-26Arthropod hemocyanin/insect LSP
Orthology groupMCL25542 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216056-TA
ATGTTTTTGCTACTATTACTCCTGGGAACAGTTAGATCAGCTCCTTTAGATGAATTTCAGACATTCGTAAACAATGAGGAAGTAGCATTTAATGATGAGGCTTATATCAAATATATTGTTCCATCCGGTTATTCTCCACACAAATCACAAAAACAATCTAAAGTAGAATTGTTGGATTTTGCAAATAAGGACAATGAACATTATGAGATCTTAAAAAAACATATTCTAGCTGGTAGTATTAAGAATGGTTTGACATTTAATATATACGACGACAACATGAGAGAAGCAAGCATTGCTTTGTTTCGATTATTACAATATTCTGAAAAAGAGCAAATAAGCAAAATAAAGGAATGGGCATTGGAGAATATAAATCATGATATCATAGATTATGCCTGGAGATTAGTCTCGCTTTACAGAACTGATGTTATGAAAGAACAGGAACCACCTTATGTATCCAAACCGAACTATTTCATAAACAGCGAAGCTATTTACAAAGCTTTAAAATTAAAAATTAGCAACGGAAAATTCGATTCTCAAACAGCAAGTGTTCAACAGTTCTATAGAAGTGACGATGTTATAACGATTAATGCTAACTATTCCGGATGGAACTTATTAAATGAAGACTGTAACGATAAACTTGATTACTTTAGAGAAGACATAGGTCTTAACAGCTACTACTACGGTGTCCATCTTCAATATCCATTTTGGATGAATAACGATGAATTAACTGGCATTGATCCAAAATATGCAGAACAATACTATTACATACATAAGCAATTGATGGCTAGGTATAGTTTAGAAAAGGAACACCCTGATTATAATAATTCTCAATTTGAATCTAAATGCTACGAGGATTTCATACCTTACTTAGTACATGACAACGGCTTGAACTTTGCAGTAAGATCAACTATAAAGAAGGAGAATAGCGAGGAATATGCACGTTTAAAGTCTGTAGATATAGCAATAAGAGAATGCATTGCAAGAGGATTTATTTACATGGAAAATAGCACACGCGTTACATTAACTGATGAAAACTTCGTTGATCTACTATCAAAATTAATTAGGGTAAATTTGGAAAGCGTGTCTATGGCAAAAATAATAAGATCTCTTTATGGTTACGGAGGCAAAGGATATTTTAAAAATGCGTTAGTATGTTCCGGCACCTTCAGTAATGCATCATCCACAAACTACGCTGCGCGATCCCATGTATTGGTATATAATTCAAACTATGCTAGATTACTTTACGGATTACTCAAACTCATTGGAACCATATAA

Protein sequence:

>DPOGS216056-PA
MFLLLLLLGTVRSAPLDEFQTFVNNEEVAFNDEAYIKYIVPSGYSPHKSQKQSKVELLDFANKDNEHYEILKKHILAGSIKNGLTFNIYDDNMREASIALFRLLQYSEKEQISKIKEWALENINHDIIDYAWRLVSLYRTDVMKEQEPPYVSKPNYFINSEAIYKALKLKISNGKFDSQTASVQQFYRSDDVITINANYSGWNLLNEDCNDKLDYFREDIGLNSYYYGVHLQYPFWMNNDELTGIDPKYAEQYYYIHKQLMARYSLEKEHPDYNNSQFESKCYEDFIPYLVHDNGLNFAVRSTIKKENSEEYARLKSVDIAIRECIARGFIYMENSTRVTLTDENFVDLLSKLIRVNLESVSMAKIIRSLYGYGGKGYFKNALVCSGTFSNASSTNYAARSHVLVYNSNYARLLYGLLKLIGTI-