Monarch geneset OGS2.0

DPOGS214259
TranscriptDPOGS214259-TA2310 bp
ProteinDPOGS214259-PA769 aa
Genomic positionDPSCF300014 + 1520873-1525706
RNAseq coverage415x (Rank: top 29%)
Annotation
HeliconiusHMEL0113781e-16274.73% 
BombyxBGIBMGA005976-TA0.069.02% 
DrosophilaCG12547-PA2e-13838.56% 
EBI UniRef50UniRef50_B0XF455e-15142.60%NHL repeat containing 2 n=6 Tax=Endopterygota RepID=B0XF45_CULQU
NCBI RefSeqXP_001657496.12e-15640.98%hypothetical protein AaeL_AAEL000965 [Aedes aegypti]
NCBI nr blastpgi|1571123244e-15540.98%hypothetical protein AaeL_AAEL000965 [Aedes aegypti]
NCBI nr blastxgi|1571123248e-15241.12%hypothetical protein AaeL_AAEL000965 [Aedes aegypti]
Group
KEGG pathwaysus:Acid_32959e-08 
 K13735 (yeeJ)maps-> Bacterial invasion of epithelial cells
InterPro domain[394-600] IPR0110428.3e-39Six-bladed beta-propeller, TolB-like
[45-189] IPR0123364.9e-27Thioredoxin-like fold
Orthology groupMCL12895 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214259-TA
ATGGATTCCAGTCCACTGGATTACGTGGCTCAGGCCTGTCTTGATTTAACGGAGGCTTTGGCGTCCACTTCAAATGCAACTGATCGCGAGACTCTCATAACAAATCATATCAAAAAAGTTTGGGCGATTGTTCCACCCATAGAGGATTTTAAAAAAAATTTGGAATGGGTAAATGTATCAGAACCTCTATCTCTGAGTCAACATTGTTCAGAGAAGGTAGTTGTTCTAGATTTTTGGACTTATTGCTGTATCAACTGCTATCATGTTCTACCGGATCTGGATTATATAGAAAACCTATATAAAAATGATAGTGGACTGGTTGTGATAGGTGTTCACTGTGCCAAGTTTACAAATGAAAAGTCTTCCAGTAATGTGTTAGCCGCAGTTCAAAGATATAATATCCGTCATCCAGTGGTTAATGATGCAGAGAGTGTTATGTGGGAGACCCTGGGCATTAGATGCTGGCCTACACTCCTTATATTAGGTCCTGGCAACAAACCAATATTCATTTTAACAGGAGAGGGCCACAGAGCGGAACTATGTTGTTATCTGGGATCGGCGTTACAGCATTTTGCTACAAGATTGTCAAATAGTTCATTGCCAGTATCACTTAATTCTTCTGTGAAAGCTAAGGACAATGATAAACTTTACTTTCCTAGTAAGATAGCTTTAAATCCATTCTATCGTGGACGTGGTGAGGAGCCTTTCTTAGCAATATCTGACACGGGTCATCATCGTGTTCTGCTGACCGACTGTTCGGGCATCATATTGAGGGTAGTTGGAGGGAAGACACCTGGATTTAAAGATGGAAAGTTGACAGATGCACAGTTCAATTCACCTCAGGGTCTCTGTTGGTTGTCAAGTTCAGTGTTAGTTGTTTGTGATACAAATAACCATGCATTGAGAGCTGTACATCTCGATGAAGGAACCGTTGAAGTGTTGGCTGGGACAGGCGAGCAGGCTGTGGTCGGAGATTTTGGTGGGAAATGTCTAGGCCTTCAAGCGTTGTCTTCGCCGTGGGATGTAATATTATACACGACCCCTGACATGGACATGTCTGTTCGTCCCAGCCTGCCCCCTCCCCCTCCCCCTCCGCCAGGAGTCACGGTTGTTGAGAAGGAGATAGAAACTAAAGATGATACCAAAGGACCTCTATTCTGTGGTCGGTTTGTAGACAAGAAAGACGAGAAACGCCGCGTGCTTTTGATAGCTTGTGCTGGATCACACCAGATCTGGGCACTGTTCCTCGATAACACTATTTGGTGGAAATACAAGTCGTATTCAGAAGGTACGTGCGTGTGTGTGGCGGGGTCGGGGGCGGAGGCCGCCCGCAACAGTGCGTACGCAGCCAGCGCCGCCTTCGCACAGCCATCAGCTCTCGCGCTACGATCCGGATCGAGCCCAGAGGTATTCATAGCTGACTCCGAGAGCTCGTCCATAAGAAGACTCGCGCTGTCCACTGGACAAGTTAGTACGTTATGTGGAGGCGACAGGAACCCATTGAATCTCTTCGCCTTTGGGGACGTCGATGATGTCGGGGTTGAAGCAAAATTGCAACATCCAATGGCAGTGGCTTACAATGAGGCCAATAAAACATTATATGTAGCAGACACGTACAATCACAAAATAAAGAAAGTCGATGTCGGACCACAGAAAGTGTCGACGATCAATCCAACGATGATCGAAAGCACTGATCCGGCTAAGTTCAACGAACCCTCAGGCCTGTCCATCAGCTCGGACGGGAAATACCTCTACATAGCCGACACCAACAATCACAGCATAAAAATACTGAACGTAGCCAAAAACGTGTGTCAGGAGTTCAAAGTGCGTCTACCCGATCCGAAATTCACGGAACCCGAGAACCTGATCCTATATAAAAACGATCTGTTCGTGAACAGAAAGTGCGGCCACCTCATCATATACTTCAACGTTAATTTGGACGCTGAGACCAAGAACGTTAAATTCACACCCGGCGCCCCACAGAACTGGCACGTGTGTGTCCGGGACGATAATAACAAGGACGTTACGTTGGACGACTTTGAATTCGTTGGCTGTAGCCACAAAGGGAACAAACTGCCGGGGAAAGTGGAAATGAAGCTTAAAACTAGAACCGATAAGACACATTATCGCATGTACTTGAGCTTCCAAACCGCTCTCTGTGATTCCTCTGTGTGTTTCGCTCATCCCTTCACGATACGATCTACGATCTTAGTCAGGGATTCCGTCAAAATGGTCGAGTCTTATAAGATAACTTGCAAAGTGAACCCCGTCAACAGGGTCGAACAGAGGCCTGACCTGGCTAAAGCATGA

Protein sequence:

>DPOGS214259-PA
MDSSPLDYVAQACLDLTEALASTSNATDRETLITNHIKKVWAIVPPIEDFKKNLEWVNVSEPLSLSQHCSEKVVVLDFWTYCCINCYHVLPDLDYIENLYKNDSGLVVIGVHCAKFTNEKSSSNVLAAVQRYNIRHPVVNDAESVMWETLGIRCWPTLLILGPGNKPIFILTGEGHRAELCCYLGSALQHFATRLSNSSLPVSLNSSVKAKDNDKLYFPSKIALNPFYRGRGEEPFLAISDTGHHRVLLTDCSGIILRVVGGKTPGFKDGKLTDAQFNSPQGLCWLSSSVLVVCDTNNHALRAVHLDEGTVEVLAGTGEQAVVGDFGGKCLGLQALSSPWDVILYTTPDMDMSVRPSLPPPPPPPPGVTVVEKEIETKDDTKGPLFCGRFVDKKDEKRRVLLIACAGSHQIWALFLDNTIWWKYKSYSEGTCVCVAGSGAEAARNSAYAASAAFAQPSALALRSGSSPEVFIADSESSSIRRLALSTGQVSTLCGGDRNPLNLFAFGDVDDVGVEAKLQHPMAVAYNEANKTLYVADTYNHKIKKVDVGPQKVSTINPTMIESTDPAKFNEPSGLSISSDGKYLYIADTNNHSIKILNVAKNVCQEFKVRLPDPKFTEPENLILYKNDLFVNRKCGHLIIYFNVNLDAETKNVKFTPGAPQNWHVCVRDDNNKDVTLDDFEFVGCSHKGNKLPGKVEMKLKTRTDKTHYRMYLSFQTALCDSSVCFAHPFTIRSTILVRDSVKMVESYKITCKVNPVNRVEQRPDLAKA-