Monarch geneset OGS2.0

DPOGS206362
TranscriptDPOGS206362-TA1194 bp
ProteinDPOGS206362-PA397 aa
Genomic positionDPSCF300082 + 1313107-1315288
RNAseq coverage812x (Rank: top 16%)
Annotation
HeliconiusHMEL0171250.083.12% 
BombyxBGIBMGA014224-TA6e-17974.62% 
Drosophilayellow-b-PA3e-11850.00% 
EBI UniRef50UniRef50_F1AAM60.082.87%Yellow-b n=8 Tax=Nymphalidae RepID=F1AAM6_9NEOP
NCBI RefSeqXP_319306.45e-12352.62%AGAP010145-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3235059630.082.87%yellow-b [Heliconius melpomene]
NCBI nr blastxgi|3235059630.083.08%yellow-b [Heliconius melpomene]
Group
KEGG pathway 
InterPro domain[1-384] IPR0035342.9e-132Major royal jelly-related
[2-340] IPR0110421.2e-69Six-bladed beta-propeller, TolB-like
[11-32] IPR0179962.7e-07Major royal jelly
Orthology groupMCL16594 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206362-TA
ATGGGCCTGGAGGTGTTCGCAGATCGGCTGTTTATAACAGTCCCACGGTGGAGGAATGGAGTGCCCGCCAGTTTGACATATGTCAATCTGAAAGACAACTCGACGAAGTCACCTAAACTGATCCCGTATCCGAATTGGGAGGCTCACAACATCGTCGATGGTAAACCTGAGATCGTCTCTCCGTTCAGGGTGAGAGCTGACAGATGCGGTCGTCTTTGGGTTTTGGACAATGGGAAGATTGGCAGCCTCGAAGCTAATGTCACCAAGTTCACACCATCTATATCTATTTACGACCTGAAGACGGACAATCTCTTGAGGAAGTACGTGTTCCCCGAGGACCAAGTGAAGGAAGATTCAGGGTTCGCGAACATTGCCGTGGAGGATATAGACTGTGATAGAACTTATGCTTACGCCGGTGACGTCGGGAAGGCCGGGGTCGTGGTGTACTCGTGGGAGAAGAACGACTCCTGGAGGATCACCCATCACTTCTTCAACCCCGACCCGCTGGCCTGTGACTTCAGCGTGAAGGGCTACAACTTCAGCTGGACAGACGCCATCTTCGGCATCGGCATATCGGCGCCAAATGCTGACAACTTCAGCACGATATACTTCCACCCGATGGCCAGCAACAACGAGTTCGCCGTCTCCACTGAGTACTTGAGAAACAAGACGGTCGCCGAACAGAACTTCGACGCCTTCAAAGTCCTCGGCAGCAGAGGACCGAATGCGCAGTCCAGCGCCTCGTTCGTGGACCCGAAGACCGGAGTGTTGTTCTATTCGCTGGTAAATCTGAACGCCGTAGCGTGCTGGAGAACCACCAACAAGGAATACCTGATGAAGAATCAGGGAAGAATATACATGGATGATGTTAAAATGATCTATCCCACGGACATCAAGGTGGACTACGACGAGAACCTCTGGGTCCTGTCGAATCGCATGCCAATATGGATGTACGCTAAGCTGGACAGTAACGACACCAACTTCAGGGTGTTCTCCGCTCCGGTCCTCAAAGCGATCAGTCACACAGCCTGTGACGTCACCGCCAGATCAGACATCCTCGACAAGTTCGTTAATAAAGTCAAAAACGTCACCAAAAACATATCGTCCAAGTTGAATCCCAACTCCGCGCCGATCCTCTCGTCCTCTTCATTGACATCCCTCGTAGTTCTGACCGTCTTGAGTTTGTTGGTGTGA

Protein sequence:

>DPOGS206362-PA
MGLEVFADRLFITVPRWRNGVPASLTYVNLKDNSTKSPKLIPYPNWEAHNIVDGKPEIVSPFRVRADRCGRLWVLDNGKIGSLEANVTKFTPSISIYDLKTDNLLRKYVFPEDQVKEDSGFANIAVEDIDCDRTYAYAGDVGKAGVVVYSWEKNDSWRITHHFFNPDPLACDFSVKGYNFSWTDAIFGIGISAPNADNFSTIYFHPMASNNEFAVSTEYLRNKTVAEQNFDAFKVLGSRGPNAQSSASFVDPKTGVLFYSLVNLNAVACWRTTNKEYLMKNQGRIYMDDVKMIYPTDIKVDYDENLWVLSNRMPIWMYAKLDSNDTNFRVFSAPVLKAISHTACDVTARSDILDKFVNKVKNVTKNISSKLNPNSAPILSSSSLTSLVVLTVLSLLV-