Monarch geneset OGS2.0

DPOGS207345
TranscriptDPOGS207345-TA1881 bp
ProteinDPOGS207345-PA626 aa
Genomic positionDPSCF300188 + 214468-218214
RNAseq coverage1091x (Rank: top 12%)
Annotation
HeliconiusHMEL0022063e-9288.34% 
BombyxBGIBMGA010273-TA0.090.48% 
Drosophilaosa-PB1e-12359.07% 
EBI UniRef50UniRef50_B0WTH62e-12865.08%Putative uncharacterized protein n=1 Tax=Culex quinquefasciatus RepID=B0WTH6_CULQU
NCBI RefSeqXP_001814255.18e-13062.43%PREDICTED: similar to osa CG7467-PA [Tribolium castaneum]
NCBI nr blastpgi|1892337622e-12862.43%PREDICTED: similar to osa CG7467-PA [Tribolium castaneum]
NCBI nr blastxgi|1700491384e-12964.11%conserved hypothetical protein [Culex quinquefasciatus]
Group
KEGG pathway 
InterPro domain[304-567] IPR0219067.1e-95Protein of unknown function DUF3518
Orthology groupMCL12043 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207345-TA
ATGAGCTTAGCTGCGAGGAGCAGCGACTACGTAGCTTTGTACATAGCGGAACAGAGAACGGATCTCAATCGTGTGACGTTGAGCATAAAACGAGACTTGTACAACCGACAGGGTGGCGGAGCCAGCGCAGGTGGACCGGCCGGAGCCCCCCGGCGGCACCCGGACTTCGCCAAGCAAGAGGGGTACGCGGGTCCCGGAGGCCCGGGCGGCCCAGCGGGAGCGGCGCGGTTCGCAGGTGGCGGCTGGGCGGGCGGCTTCCCCCGTGGCGGCCCACCCGCGCCTGCCTGGCGCCCTCCCGCCCCCCTACCACACGCGCCTCACCACCCACACCAGCCGGCCTGGCCTCACCACCAGCCGCACCAGCCACATCAACCTTACCACCCGCCGACGCCGGGCGGCGTGGCGTGGGGCGCCCCGCGACCGCCGCAGGAACTACCGCCCGCCGCGTCTTCACCCGGTGCAGCGGGCGTAGGTCAATTAAAGAGGGAATTAACCTTCCCCGCCGAGTGTATAGAGGCGACCGTCCCCGCCGCGGAGAAGCGTCGCCGACTGACCAAGGCCGACGTGGCACCCGTCGACGCCTGGAGGATCATGATGGCGCTCAAGTCCGGTCTGCTGGCCGAGACCTGCTGGGCCCTCGACATACTCAACATACTACTCTTCGACGACAATTGCATAGGATACTTCGGCCTCCAGCACATGCCCGGCCTACTCGACCTCCTGCTCGAGCACTTCCAGAAAAGTCTCGGAGACGTGTTCGACGCTCCCGCTACCGAAAGCGAACCCTGGCGACCGGCGCTGCAAGTAAGGGACCCCGCCGGCGTGCTTAAACGACGCCGCCTAGAGGACTACGAGGACGAGTGTTACACGCGCGACGAACCAAGCCTAAACTTAGTGAACGAATCCCGGGACGCCCTCGCGAGACGATGTATCGCGTTATCCAATATATTACGTGGACTCACGTTCGTGCCAGGAAACGAGGCGGAGTTCTCCAGGTCCGGGGCGTTCCTCGCCCTCGCCGGGAAACTGTTGCTGCTTCACCACGAGCACGCGCCCAGAGCCGCGAGAGCGAGGGCTTACGAGCGAGCGGCGAGAGACGAAGTCGACGTGGACTCTTGCTGTTCGAGTCTTCGGGGAGAAGGGGAGTGGTGGTGGGACACGCTGGCCCAGCTGCGGGAGGACGCGCTCGTCTGCTGCGCGAACATCGCGGGCAGCGTGGAGCTCGGCGGCCAGCCGGAGGCCGTGGCGCGGCCGCTGCTGGACGGCCTGCTGCACTGGAGTGTGTGTCCGGCTGCCGTGGCGGGAGACCCCCCGCCCGCCGCCGCGCCCGGCTCTCCGCTGTCTCCGCGCCGCCTCGCTCTGGAGGCGCTGTGCAAGCTGTGCGTGACGGACGCCAACGTGGACTTGGTGCTGGCGACGCCGCCGCGCGGTCGCATGGCGGCTCTGTGTGCGGGACTGGCGCGAGACCTGTGTCGGCCGGAACGGCCCGTGGTGCGAGAGTTCGCCGTCAACCTGCTGCACTACCTGGCAGGAGCCGGAGGCGCGGCGGCGCGGGAGGTTGCCATGCACGCGCCGGCCGTGGCGCAGCTGGTGGCGTTCATCGAGCGCGCGGAGCAAACCGCGCTGGGCGTCGCCAACCAGCACGGGGTGGCGGCCCTGCGAGACAACCCGGATGCGATGGGCACCTCACTAGACATGCTGCGGCGCGCCGCGGCCACGCTGCTGCGGCTGGCGGAGCACCCCGAGAACAGGCCGCTGATCCGCCGCCACGAGCGCCGCCTGCTGTCGCTTGTCATGAGCCAGATCCTCGACCAGAAGGTGGCGCACGAGCTGGCCGACGTGTTGTTCCACTGCAGCCAGGCGGCCGGCCAGGCGGACTGA

Protein sequence:

>DPOGS207345-PA
MSLAARSSDYVALYIAEQRTDLNRVTLSIKRDLYNRQGGGASAGGPAGAPRRHPDFAKQEGYAGPGGPGGPAGAARFAGGGWAGGFPRGGPPAPAWRPPAPLPHAPHHPHQPAWPHHQPHQPHQPYHPPTPGGVAWGAPRPPQELPPAASSPGAAGVGQLKRELTFPAECIEATVPAAEKRRRLTKADVAPVDAWRIMMALKSGLLAETCWALDILNILLFDDNCIGYFGLQHMPGLLDLLLEHFQKSLGDVFDAPATESEPWRPALQVRDPAGVLKRRRLEDYEDECYTRDEPSLNLVNESRDALARRCIALSNILRGLTFVPGNEAEFSRSGAFLALAGKLLLLHHEHAPRAARARAYERAARDEVDVDSCCSSLRGEGEWWWDTLAQLREDALVCCANIAGSVELGGQPEAVARPLLDGLLHWSVCPAAVAGDPPPAAAPGSPLSPRRLALEALCKLCVTDANVDLVLATPPRGRMAALCAGLARDLCRPERPVVREFAVNLLHYLAGAGGAAAREVAMHAPAVAQLVAFIERAEQTALGVANQHGVAALRDNPDAMGTSLDMLRRAAATLLRLAEHPENRPLIRRHERRLLSLVMSQILDQKVAHELADVLFHCSQAAGQAD-