Monarch geneset OGS2.0

DPOGS205579
TranscriptDPOGS205579-TA849 bp
ProteinDPOGS205579-PA282 aa
Genomic positionDPSCF300237 - 317355-340655
RNAseq coverage287x (Rank: top 38%)
Annotation
HeliconiusHMEL0084992e-8694.51% 
BombyxBGIBMGA011098-TA4e-0742.62% 
DrosophilaDll-PC5e-5374.68% 
EBI UniRef50UniRef50_P200099e-5174.68%Homeotic protein distal-less n=58 Tax=Protostomia RepID=DLL_DROME
NCBI RefSeqXP_308706.34e-5373.12%AGAP007058-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|8373397e-9697.74%Distal-less Dll limb-patterning gene product [Junonia coenia=buckeye, Peptide, 197 aa]
NCBI nr blastxgi|154258787e-9998.86%homeotic transcription factor dll [Junonia coenia]
Group
Gene OntologyGO:00036779.7e-11DNA binding
GO:00063559.7e-11regulation of transcription, DNA-dependent
GO:00055155.1e-10protein binding
GO:00435652.4e-09sequence-specific DNA binding
GO:00037002.4e-09sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[181-231] IPR0122879.7e-11Homeodomain-related
[175-231] IPR0090575.1e-10Homeodomain-like
[189-231] IPR0013562.4e-09Homeobox
Orthology groupMCL15303 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205579-TA
ATGGCGGGTATGTCCCCAGGCTCTGTAAGCGGCGTAATACCACACCAGCGGCATAAATCCCTCAAAATTACACGAATTCAATCACCAAGCACCAAACCCGACACGCTGAGTTTCTCAGATCCCTTCGGGCCTCCTCAGTCCTCGGACGGGGGGGGCCCATCCACCCCTCAGCCCGCCATGACCACCCAGGAGCTAGATCACCATCACCACCTCGGTGGATCACAAACACCCCACGATATATCCAACTCCACTAACTCAACGCCCACTAACGTCTCATCGAAATCAGCCTTTATAGAGCTTCAACAACACGGCTATGGCCCCTTCAAGGGGGGCTATCAACACCCCCACCATTTTGGCAGCCCGGGTGGTCAACAGAATCCCCACGAAGCCTCAGGTTTCCCTAGCCCCAGATCCTTAGGTTATCCATTCCCCCCTATGCATCAGAACACCTACGGTTATCATTTAGGATCCTACGCCCCTCAATGTGCAAGCCCTCCGAAAGATGAAAAATGCGGCCTCTCAGATGATCCCGGTTTACGCGTGAATGGGAAGGGGAAGAAGATGAGAAAGCCGCGCACGATATACTCCAGCTTGCAGCTGCAGCAGCTCAATCGACGGTTCCAGAGGACCCAGTACCTAGCGCTGCCGGAAAGAGCAGAGTTAGCAGCTAGTTTAGGACTCACACAGACTCAGAAAAAACCGTTCTCCAAATTTATAAAACATGATCTGTTATTATTGTCATTGGATAACATTGAAAAAAGAATGACACTTTTGGCGCCAAACGTCGAACCAATAAAAAGTCAAAGCACAGATAACGCAGGAGCCGATCTGTCACACAACAATAGATAA

Protein sequence:

>DPOGS205579-PA
MAGMSPGSVSGVIPHQRHKSLKITRIQSPSTKPDTLSFSDPFGPPQSSDGGGPSTPQPAMTTQELDHHHHLGGSQTPHDISNSTNSTPTNVSSKSAFIELQQHGYGPFKGGYQHPHHFGSPGGQQNPHEASGFPSPRSLGYPFPPMHQNTYGYHLGSYAPQCASPPKDEKCGLSDDPGLRVNGKGKKMRKPRTIYSSLQLQQLNRRFQRTQYLALPERAELAASLGLTQTQKKPFSKFIKHDLLLLSLDNIEKRMTLLAPNVEPIKSQSTDNAGADLSHNNR-