Monarch geneset OGS2.0

DPOGS204680
TranscriptDPOGS204680-TA999 bp
ProteinDPOGS204680-PA332 aa
Genomic positionDPSCF300170 - 32885-35332
RNAseq coverage90x (Rank: top 63%)
Annotation
HeliconiusHMEL0176027e-13772.06% 
BombyxBGIBMGA010249-TA5e-12264.26% 
Drosophilasmp-30-PA8e-4135.49% 
EBI UniRef50UniRef50_UPI00022CA6861e-5038.72%UPI00022CA686 related cluster n=3 Tax=unknown RepID=UPI00022CA686
NCBI RefSeqXP_001599934.11e-5238.19%PREDICTED: similar to anterior fat body protein [Nasonia vitripennis]
NCBI nr blastpgi|1565425322e-5138.19%PREDICTED: regucalcin-like isoform 1 [Nasonia vitripennis]
NCBI nr blastxgi|1565425323e-5238.78%PREDICTED: regucalcin-like isoform 1 [Nasonia vitripennis]
Group
KEGG pathwaypto:PTO09072e-27 
 K01053 (E3.1.1.17)maps-> Pentose phosphate pathway
    Ascorbate and aldarate metabolism
    Caprolactam degradation
InterPro domain[41-295] IPR0136582.9e-60SMP-30/Gluconolaconase/LRE-like region
[38-318] IPR0110421.2e-59Six-bladed beta-propeller, TolB-like
[41-58] IPR0055115.5e-40Senescence marker protein-30 (SMP-30)
Orthology groupMCL34561 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204680-TA
ATGCGGATCATTCTGTTACTGGGGCTCGCTGCTTGCCTACACGCTAAGAATCCGATAAAAAACTATAAAATTTCTATGGTATCATCTGAGGAGAGTAACAAGCCTGCGTTGTTTACACACGGAGAGAGTCCAGTGTGGGATGTAGACACGCAATCCTTATTTTTCGTAGACGTTCATCAGCAAAATGTTCACCGACTTGACTACGCCACAGGAAAAATATACACCAAGCATATCGGTTACGGTCAAGTGAACGTGGTATCGCTAGTTTCGGGGTCTCGGCGACTATTAGTATGCGTACGAGCCGGTCTGTATCTGTTGGACTGGGATGTGGCAGGAGACTCGGCATTACGTCTCATAACCACCGTGGACGATGGGCTACCGGATAACTACTTAAATGAAGGCAAACCAGATGTAGAAGGGCGGTTTTGGGCTGGGACCAAAGGCCCACAATCTGGTGACGAAGTGACACCAGATAAGGGCACCTTCTACAGCTTCGATTTAAACAATTTTAAACCTCAAGTGCAACTGCGTCCTGTGTCTATATCTAACGGCTTGGTGTGGTCCTTAAATAATACTGTTTTGTATTACATTGACTCAAGCACACAGAAGGTCGAGGCATTTGATTTTGACTCTGTGAGTGGGGCGATAAGTGGAAGACGAACAATTGTAGATATAACGAATTATGGTTACGAAGACGCAATACCGGACGGAATGACCATAGACAAAAGAGGAAACCTGTGGGTTGCGATCATGTTCGGTGGAACAGTTCTACACGTAAACCCGGACAAAAGAGAGGTTATATTTGGTTACAAGCTGCCAGTGTCACGGACGACGTCTCTGACCTGGGGCGGGCCGAATTTGGACGAATTATTCGTGACAACATCTAAAGAAACAGACTCCGAGGATAGACTGAGCGGCGCCATATTCACAATACGCGAGACGGGCAGCGCGGGACTCCCGCCTAATAAACTCAAAATGGAAAATGCGGACGATTATTGA

Protein sequence:

>DPOGS204680-PA
MRIILLLGLAACLHAKNPIKNYKISMVSSEESNKPALFTHGESPVWDVDTQSLFFVDVHQQNVHRLDYATGKIYTKHIGYGQVNVVSLVSGSRRLLVCVRAGLYLLDWDVAGDSALRLITTVDDGLPDNYLNEGKPDVEGRFWAGTKGPQSGDEVTPDKGTFYSFDLNNFKPQVQLRPVSISNGLVWSLNNTVLYYIDSSTQKVEAFDFDSVSGAISGRRTIVDITNYGYEDAIPDGMTIDKRGNLWVAIMFGGTVLHVNPDKREVIFGYKLPVSRTTSLTWGGPNLDELFVTTSKETDSEDRLSGAIFTIRETGSAGLPPNKLKMENADDY-