Monarch geneset OGS2.0

DPOGS205542
TranscriptDPOGS205542-TA672 bp
ProteinDPOGS205542-PA223 aa
Genomic positionDPSCF300056 + 552396-556253
RNAseq coverage20x (Rank: top 79%)
Annotation
HeliconiusHMEL0037924e-6864.44% 
BombyxBGIBMGA000091-TA1e-12192.41% 
DrosophilaSix4-PB2e-8879.68% 
EBI UniRef50UniRef50_E3WS852e-9178.46%Putative uncharacterized protein n=4 Tax=Endopterygota RepID=E3WS85_ANODA
NCBI RefSeqXP_309580.46e-9176.73%AGAP011065-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3123818015e-9178.46%hypothetical protein AND_05828 [Anopheles darlingi]
NCBI nr blastxgi|3123818018e-9078.46%hypothetical protein AND_05828 [Anopheles darlingi]
Group
Gene OntologyGO:00036772.4e-19DNA binding
GO:00063552.4e-19regulation of transcription, DNA-dependent
GO:00055153.7e-19protein binding
GO:00435651.2e-16sequence-specific DNA binding
GO:00037001.2e-16sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[125-186] IPR0122872.4e-19Homeodomain-related
[120-189] IPR0090573.7e-19Homeodomain-like
[122-184] IPR0013561.2e-16Homeobox
Orthology groupMCL12887 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205542-TA
ATGCGAAGATGTTTAAACTTCAACTCCGAACAGGTTCAATGCGTGTGTGAGGCCCTCCAGCAAAAAGGCGATATAGAAAAATTGGCGGCATTCCTATGGAGTCTACCACCGAGTGAATTATTAAGAGGAAATGAAACCGTTCTCAGAGCCCGCGCTTTGGTGGCGTATCATCGCGGCGTATTTCAGGAGTTGTACGCCATATTGGAGACGCACACATTCTCACCTCGTCACCACACCGATCTCCAGAACCTTTGGTTTAAAGCGCACTATAAGGAAGCCCAGAAAGTCAGAGGAAGACCGCTTGGAGCTGTTGATAAATACCGTCTTCGCAAGAAGTATCCCTTGCCAAAGACGATCTGGGATGGTGAAGAGACGGTGTACTGCTTTAAGGAAAAGTCGAGAAATGCGCTGAAAGACTGTTACTATAGAAACCGTTACCCAACTCCAGACGAAAAACGTGCGCTCGCACAAAAAACAGGCTTGACATTAACACAAGTGTCAAATTGGTTCAAGAACCGACGCCAGAGGGATAGGACACCGCAGCAACCGAATAGACCTGAAATGATGGTTCCGGCTCAATATGTTGGTTCGCAGCCGGGTTTGGCGCAATCTTTTCTTCCCAATGCCTACTATAAGCTTCAAGAATCTCACTATTTACACGGAAATCCTTGA

Protein sequence:

>DPOGS205542-PA
MRRCLNFNSEQVQCVCEALQQKGDIEKLAAFLWSLPPSELLRGNETVLRARALVAYHRGVFQELYAILETHTFSPRHHTDLQNLWFKAHYKEAQKVRGRPLGAVDKYRLRKKYPLPKTIWDGEETVYCFKEKSRNALKDCYYRNRYPTPDEKRALAQKTGLTLTQVSNWFKNRRQRDRTPQQPNRPEMMVPAQYVGSQPGLAQSFLPNAYYKLQESHYLHGNP-