Monarch geneset OGS2.0

DPOGS200620
TranscriptDPOGS200620-TA1224 bp
ProteinDPOGS200620-PA407 aa
Genomic positionDPSCF300076 + 122395-128902
RNAseq coverage13x (Rank: top 82%)
Annotation
HeliconiusHMEL0224312e-14581.07% 
BombyxBGIBMGA008906-TA0.083.94% 
Drosophilarun-PA9e-8174.42% 
EBI UniRef50UniRef50_A9QLJ30.083.94%Runt n=2 Tax=Obtectomera RepID=A9QLJ3_BOMMO
NCBI RefSeqNP_001104821.10.083.94%runt [Bombyx mori]
NCBI nr blastpgi|1624627670.083.94%runt [Bombyx mori]
NCBI nr blastxgi|1624627670.083.94%runt [Bombyx mori]
Group
Gene OntologyGO:00056343.8e-134nucleus
GO:00036773.8e-134DNA binding
GO:00055243.8e-134ATP binding
GO:00063553.8e-134regulation of transcription, DNA-dependent
GO:00037006.2e-72sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[6-407] IPR0000403.8e-134Acute myeloid leukemia 1 protein (AML 1)/Runt
[19-150] IPR0123466.2e-72p53/RUNT-type transcription factor, DNA-binding domain
[19-150] IPR0135241.1e-70Acute myeloid leukemia 1 (AML 1)/Runt
[21-147] IPR0089675.9e-60p53-like transcription factor, DNA-binding
Orthology groupMCL16986 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200620-TA
ATGCACCTTCCGCACGCAAGCCCTCCGACACCAACGATGGCGGATGTTTACTCCCACATCCACGAGTACTACCGCCAGACCCACGGTGACTTAGTGCAGACTGGTTCACCAGCTGTATTATGCTCAGCTCTGCCCGGCCACTGGAGGTCGAACAAGTCTCTGCCTGTGGCTTTCAAGGTTGTTGCCTTAGACGACGTCCAGGATGGCACTATAGTTACGATAAAGGCTGGAAATGATGAAAACGTTATGGCTGAAATGAGAAACTGCACGGCGGTTATGAAAAACCAAGTGGCAAAATTCAATGATTTGAGGTTTGTGGGACGAAGTGGACGAGGCAAGTCCTTCAGTCTCACCATCACCATCAGTACTTTTCCCTCACAAGTGGCAACTTACTCAAAGGCCATCAAAGTCACCGTTGACGGGCCGAGAGAACCCAGAACCAAACAGAATTATGGATATGGACACCCGGGAGCATTTAACCCGTTCCTGCTGAATCCTGGCTGGTTGGACGCAGCATATTTAAATTACGCTTGGGCTGATTACTTCAGACCGCCACAGCTAAGAGATCAAGCTCTTATTAAAGGTGGAGCGGCGCCAATTACTACTCCTCCAGTTGCGTTACCAGGAGCCGAACTGTTTCCGTTTCCTCCAATGATGGGAACTTTACCTCCAGGTGGACTGATACCACCGCCAGGGGCATTTTTACCACCGAACGGGATGCTGCCATTTTCTCCTCATCCGGCGGACCTTGCACTGAAAAATTTACCGCAAGATATAAAAAATGGCGTCAGTCCTTATGACGCTCTAAGGCATTTCCAAAGCAACGTCCTATCGATAGACACTTCAAGCGCTCGATTGTCTCCAACAAGCAGTAGACAGAGCGGCAGCCCACGAAGCGTTATAAATACCAGTCCAAGGTCGAAAGCTGATTCTAAATCCGAAATTAATTCAACACACGAAGCAACAATATCCGACGAATCGGATGAGGAACAGATAGAAGTCGTGAAATCAGCTTTCCATCCCACGCGACCAGCGAACGTTGAGCTCCAGGAGATGAAGCAAGTGCAAGCTGCAGATTCAACCGTATCGGATCGACCTCGTGTACGTAATGAACTCAAAGCACCTTTACATCGTACTACGCGAGTTCTATCGACTAGTCCGACGTCGACGAAAATTTCCAACGGTGCTATATCAACGCATAAATCAGTATGGCGACCGTATTAA

Protein sequence:

>DPOGS200620-PA
MHLPHASPPTPTMADVYSHIHEYYRQTHGDLVQTGSPAVLCSALPGHWRSNKSLPVAFKVVALDDVQDGTIVTIKAGNDENVMAEMRNCTAVMKNQVAKFNDLRFVGRSGRGKSFSLTITISTFPSQVATYSKAIKVTVDGPREPRTKQNYGYGHPGAFNPFLLNPGWLDAAYLNYAWADYFRPPQLRDQALIKGGAAPITTPPVALPGAELFPFPPMMGTLPPGGLIPPPGAFLPPNGMLPFSPHPADLALKNLPQDIKNGVSPYDALRHFQSNVLSIDTSSARLSPTSSRQSGSPRSVINTSPRSKADSKSEINSTHEATISDESDEEQIEVVKSAFHPTRPANVELQEMKQVQAADSTVSDRPRVRNELKAPLHRTTRVLSTSPTSTKISNGAISTHKSVWRPY-