Monarch geneset OGS2.0

DPOGS203195
TranscriptDPOGS203195-TA519 bp
ProteinDPOGS203195-PA172 aa
Genomic positionDPSCF300035 + 77107-85896
RNAseq coverage255x (Rank: top 41%)
Annotation
HeliconiusHMEL0137261e-1666.15% 
BombyxBGIBMGA011413-TA7e-2474.67% 
Drosophilanau-PA8e-2452.25% 
EBI UniRef50UniRef50_E5SR133e-2553.33%Cuticle collagen 1 n=1 Tax=Trichinella spiralis RepID=E5SR13_TRISP
NCBI RefSeqNP_001158428.19e-2546.72%myogenic differentiation protein [Saccoglossus kowalevskii]
NCBI nr blastpgi|3392521468e-2553.33%cuticle collagen 1 [Trichinella spiralis]
NCBI nr blastxgi|3392521465e-2453.33%cuticle collagen 1 [Trichinella spiralis]
Group
Gene OntologyGO:00056346.2e-17nucleus
GO:00063556.2e-17regulation of transcription, DNA-dependent
GO:00036777.5e-10DNA binding
GO:00075177.5e-10muscle organ development
KEGG pathway 
InterPro domain[49-113] IPR0115986.2e-17Helix-loop-helix DNA-binding
[23-58] IPR0025467.5e-10Myogenic basic muscle-specific protein
[59-109] IPR0010921e-09Helix-loop-helix DNA-binding domain
Orthology groupMCL15188 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203195-TA
ATGAGCTACAGCGCCATCTATGACTACAGTTTGGATTTACCGAAAAACAATGGGTGTTGTGATATAAAGGAGGAGGAAAAGAACGAGGAGCACATACAGCATGTGCTGGGACCCAGCAGACGATGTCTGGCCTGGGCCTGCAAGGCCTGCAAGAGAAAAACAGCAGCTGTTGACAGGCGGAAAGCGGCCACGCTACGTGAGAGAAGGAGATTAAGAAAAGTGAACGCTGCTTTCGAAGAGTTACGAATACGTGCTCGGGCTGGGAGTGGACGTCTGCCAAAGCTGGAGATACTCCGAGCAGCCATCCAGCACATTGAGAGGCTCCAAGCCGCCCTTAGAGCAGCAACCGCTCTGGACTCTGACAGCTGCAAGAACTACGCAGAGCCGGCGAGGAAGCCTGACAGCAAGCAAAGTAAAAATGAAAAGGAGAAGCCCTTCCTCAGCGGTGACCCAACGTGCGGGCTGAACAGTCTAGCGAGGTTGCGCTGCATCGTACAGGCTCTGGCAAATGAAACGTGA

Protein sequence:

>DPOGS203195-PA
MSYSAIYDYSLDLPKNNGCCDIKEEEKNEEHIQHVLGPSRRCLAWACKACKRKTAAVDRRKAATLRERRRLRKVNAAFEELRIRARAGSGRLPKLEILRAAIQHIERLQAALRAATALDSDSCKNYAEPARKPDSKQSKNEKEKPFLSGDPTCGLNSLARLRCIVQALANET-