Monarch geneset OGS2.0

DPOGS202194
TranscriptDPOGS202194-TA1092 bp
ProteinDPOGS202194-PA363 aa
Genomic positionDPSCF300149 - 359495-364009
RNAseq coverage327x (Rank: top 35%)
Annotation
HeliconiusHMEL0091942e-12865.85% 
BombyxBGIBMGA013515-TA1e-10558.22% 
DrosophilaCG16812-PA7e-3157.38% 
EBI UniRef50UniRef50_E0VIQ22e-3134.29%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VIQ2_PEDHC
NCBI RefSeqXP_002425996.13e-3234.29%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|2420104816e-3134.29%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastxgi|2420104811e-2930.23%conserved hypothetical protein [Pediculus humanus corporis]
Group
Gene OntologyGO:00055152.4e-09protein binding
KEGG pathway 
InterPro domain[10-76] IPR0137612.8e-11Sterile alpha motif-type
[14-77] IPR0109932.4e-09Sterile alpha motif homology
Orthology groupMCL22001 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202194-TA
ATGAGTGCAGGATCTAATAAAATGGATTCCAATATCACTAGTTCGTGGGTCAATTTCTTCACCGCCGCTGGGATACCTTCGGAGGTGGGAGCCACCTACGCTATCACCTTCACCGAGAATCGGATTCAAAATGATATGTTGCTTGACTTGAACAAGGAATACCTAAGAGATATGGGCATCACAAGGATGGGAGATATAATAGCCATATTAAGACATGCTAAAATGGTGCACGAGAGTACGGCCAGGGAAAGGGTGTTGAGCACAACGGTGGCTAGCAACAAGGTGCCCGTGGCTGCCGTCACTGGCAGAGCTACCGTTACCCAACCATCATCTCCTGCGAGTCGCATGTTAGAACACTACACCAGGAACCCTCAAGTACAAGAGACCCCTCCGCTGAGAGCCGCGCCGCAGAAACGGAAGTCCAGTGAACTGAACCCAGAAGATAAGAATGTTAAGAAGTCTCGTCTCATCAGATTCGGTACTCCTCCACAGACAGCCGTCACCACTAAGGAAGCAGCTCAAAATAAGACTGTTTTCGCACGTCTTGGTCATTCAGATCCAGTACCGGATGTTCCTCAAAAGGTCGGAAAGAAAGTGTTTTCCAGACTCGGTGCCAAGGACGACAAGGAAAAAGACCAGGTCGTGCCCATCGAAAAGGACGCTCTCAAATACGAGGGTATCTTGAAAACCAGTCCGGAGCCCAGGAAAGTGTTCACCGTCACCACATCGGTAAATAATATAAGGAAAATAGCGTTAGGCACCATGAGGGCTGATGAGACGCCGGTCAGTGTGAAGGACAAACTGGCCATTGCCCGAGCGAAATCCGTGAAATTCTCAAATCGCGTGGAGTATAAAGAAATAGAAGCAGTCAACAAGGTCCAGCTCAAGCCCAGGCTCACAACCGTCTTTAATAAACCGGAGAGGAGGTTGGCCATGCCGGAGAACACGGGCGTCAAGGCCAGGCTGGGAAACAAACGCGCCGGCGCCACCAAGATGGCCGCCGTCAACAAGATGGCGGTCGGCACCAAAAAACAAAACGCGCAGAGTAAATTCACCATAACTAGGAACGTCTTCAACAGACTGGGCGTTTGA

Protein sequence:

>DPOGS202194-PA
MSAGSNKMDSNITSSWVNFFTAAGIPSEVGATYAITFTENRIQNDMLLDLNKEYLRDMGITRMGDIIAILRHAKMVHESTARERVLSTTVASNKVPVAAVTGRATVTQPSSPASRMLEHYTRNPQVQETPPLRAAPQKRKSSELNPEDKNVKKSRLIRFGTPPQTAVTTKEAAQNKTVFARLGHSDPVPDVPQKVGKKVFSRLGAKDDKEKDQVVPIEKDALKYEGILKTSPEPRKVFTVTTSVNNIRKIALGTMRADETPVSVKDKLAIARAKSVKFSNRVEYKEIEAVNKVQLKPRLTTVFNKPERRLAMPENTGVKARLGNKRAGATKMAAVNKMAVGTKKQNAQSKFTITRNVFNRLGV-