Monarch geneset OGS2.0

DPOGS200421
TranscriptDPOGS200421-TA2550 bp
ProteinDPOGS200421-PA849 aa
Genomic positionDPSCF300236 - 260174-272899
RNAseq coverage15x (Rank: top 81%)
Annotation
HeliconiusHMEL0037260.073.01% 
BombyxBGIBMGA008996-TA0.074.52% 
Drosophilaham-PB9e-6744.06% 
EBI UniRef50UniRef50_E9J3V01e-13141.51%Putative uncharacterized protein (Fragment) n=2 Tax=Myrmicinae RepID=E9J3V0_SOLIN
NCBI RefSeqXP_002430717.17e-12940.16%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|3227858875e-13141.51%hypothetical protein SINV_13327 [Solenopsis invicta]
NCBI nr blastxgi|3504274875e-16643.35%PREDICTED: LOW QUALITY PROTEIN: transcription factor hamlet-like [Bombus impatiens]
Group
Gene OntologyGO:00036763.3e-11nucleic acid binding
GO:00082703.5e-07zinc ion binding
GO:00056223.5e-07intracellular
KEGG pathwaytgu:1002306621e-64 
 K04462 (EVI1)maps-> Pathways in cancer
    MAPK signaling pathway
    Chronic myeloid leukemia
InterPro domain[708-735] IPR0130873.3e-11Zinc finger, C2H2-type/integrase, DNA-binding
[708-730] IPR0070873.5e-07Zinc finger, C2H2
Orthology groupMCL15906 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200421-TA
ATGGGGTGGTCAATGATAAATGCCCAAGCTGTTATCCCGCAGACACTGACAGATACTAATAGTGCCATCTATCGTGAAGCCGCGCTGTCCGCTGTCAGAACTCGCCAGGTGCCGCAAGATGACGCCACCGCTGTTTTCGGTAAAAACGGCGTTCGTTCGTGGCTGGACGCGGCTGCGGACAAGTCTAACTGGTTCAAACTCGTCCGCTGCGCAACCTCTCCGCACGAAGTCAATCTGCAACACGAAAAGTTTGCAGGACAAGTCTGGTATAAAGTGACTCGTGACGTGTCAGCAGGACAAGAGCTGTTGGTCGGAGCTTGGACGTCACTGCCGTTACAAGATGTTCTCACAACTGGTAGAGAGAGTGCCAGCAGTCACTCTCACCAGCAACAGGACGAAGAAGACAGAGAGGATACAAAACCACGATGTTCCTTCTGTGACGAACCATTCCCTAATATTGATGCACTTGACCGTCACTTGATTCAAGCACATGCCCAGCCAGCTTCGGCATATCATTGCGAGCTGTGCAACAGAGCGTACAGTTCCCGGGCACTTCTCCTAAGACATCGGGCGTTAACACATACCGATATCAGGAAATATCCCTGCGAGAATTGTCCTAAGGTATTTACCGATCCTTCCAACCTCCAGCGCCACATCCGCGCGCAGCACGTGGGTGCCCGCAGCCACGCCTGCCCTGAATGCGGCAAGACCTTCGCTACCAGCTCCGGCCTTAAGCAGCACACACATATTCACTCCAGTGTCAAGCCCTTCCAGTGCAAAGTCTGCTTCAAGGCATACACTCAATTCTCTAATTTGTGCAGACACAAACGAATGCACGTTGCGTGTAGAGCATTGGTAGAGTGTGGGAAATGTGGACAATCATTTACGTCGTACGCATCTCTTACTAAACATAAAAGATTTTGCGATACTGCTTCTGCAACGAACGTAAATCTGAGAGGACAAATTGGTCAAGGATTACCGCAGATACCACCTATTCCAAACGTCATGAATAATCCGAATAATACAAATCCATTCCCCATGTACAGAGGTCCAGCTCTGCCGTTACCATACAACACTTTCGCACATTACCCAGCCTTTATCTCCGCTGCTGCCGCCGCAGCTTGCCCTCCAGACTTTTTAAGTCCCCTCCTCTTCAATGTCCAAGGAGCGAGGTTAGCTATGGAGCATGATTTGGCACTTAACGCCAGTTTAATGGCCAAGCAACAACAGGAAGAACGCATGTCAGTAAAAAAGGAAACAGAGAGCATAGATAGTTCAACATCTGTAGATATAATAAATAAAGCGAAAGAAATTACCAGAGATGAGAAAGATATGGATGTAGACAGAGTAACACCGAAACCTCAGGAACATTATGTAAAACAATCACCGCCGTCGGCTGAAGAGGCTACTTCAAAACAACGTCCTTCTCCAGTGATGCCACTGTCGACTACTGTTGGACCCTTTGATTTTGCAAGAAATGAAACAAAACACAACTCTATGTACGATTTTTCTTTGAAAAATAACAACGAAACTTTAGAAAATAAGTCAATGTCTCCTCAACCTAAAGATTTGACGAGGAACAACATGTCCAGTGATGTGGAAAAACAATCAAGATATTCTAATTTAGAAGAAGAAATAAAAGAGCAGAATGACCAACCGCTCGACTTATCCGTCACTCGAAAACAACGTGACAAGGAGTCAGACCTAGAAAATGATGATCATTCCTTTCGAAATTCATCGATTAAATCTTATTCACCTGCTGAAAGTCCTGTTGATAGAGAGAATAAGACTCCGGAAAATGAGACAACTGATGTTGACGTGGAAGCAGTTGAACCCAAAAGAGAGGATTCCCCAGTATCAATGATGTCTCCTCCCTTAGCATTCCCAATGGCTGTACATGCTCAGCACAATAACAGTCTCATGAACGCAATGTACCCACCACGTTTTACACGTTTCCATTCGACTTCTGACTCCATACTAAGCGCACAGCACTCACCATACGTTCCCAGCCCGTTTAATTTTTTATCGCCACTTCTCGGCACTGATGGCCCCGATAGGCAATCAAGTGCCTATGCGAAATTTCGAGAACTTAGCGCTGGTTCCGGCAAACTGCGAGATCGCTACGCTTGCAAATTTTGCGGAAAAGTATTTCCGCGAAGTGCCAACTTAACGCGTCATTTACGTACGCACACCGGCGAGCAACCATACAAGTGCAAATATTGTGAGCGTTCCTTTTCCATATCCTCTAATTTACAGCGACACGTAAGAAACATTCATAACAAAGAGAGACCGTTTAGATGTCAGTTATGCGATAGATGTTTCGGTCAGCAGACTAACCTAGATCGACACCTTAAGAAACATGAGGCGGAAGGTGGTGATTCACCAAGTTCCGGGGATACTGAACACGACGCGTGTTTTGATGATATTCGTTCTTTCATGGGGAAGGTGACCTGTTCTCCTGGAGCAGGATCCCCAGCAGCGACTTCTCCTCACCCATCTCACGCCCCACATCCTTCTCATCGACCTTCAGCGCTTTCCATTTCCACCTAG

Protein sequence:

>DPOGS200421-PA
MGWSMINAQAVIPQTLTDTNSAIYREAALSAVRTRQVPQDDATAVFGKNGVRSWLDAAADKSNWFKLVRCATSPHEVNLQHEKFAGQVWYKVTRDVSAGQELLVGAWTSLPLQDVLTTGRESASSHSHQQQDEEDREDTKPRCSFCDEPFPNIDALDRHLIQAHAQPASAYHCELCNRAYSSRALLLRHRALTHTDIRKYPCENCPKVFTDPSNLQRHIRAQHVGARSHACPECGKTFATSSGLKQHTHIHSSVKPFQCKVCFKAYTQFSNLCRHKRMHVACRALVECGKCGQSFTSYASLTKHKRFCDTASATNVNLRGQIGQGLPQIPPIPNVMNNPNNTNPFPMYRGPALPLPYNTFAHYPAFISAAAAAACPPDFLSPLLFNVQGARLAMEHDLALNASLMAKQQQEERMSVKKETESIDSSTSVDIINKAKEITRDEKDMDVDRVTPKPQEHYVKQSPPSAEEATSKQRPSPVMPLSTTVGPFDFARNETKHNSMYDFSLKNNNETLENKSMSPQPKDLTRNNMSSDVEKQSRYSNLEEEIKEQNDQPLDLSVTRKQRDKESDLENDDHSFRNSSIKSYSPAESPVDRENKTPENETTDVDVEAVEPKREDSPVSMMSPPLAFPMAVHAQHNNSLMNAMYPPRFTRFHSTSDSILSAQHSPYVPSPFNFLSPLLGTDGPDRQSSAYAKFRELSAGSGKLRDRYACKFCGKVFPRSANLTRHLRTHTGEQPYKCKYCERSFSISSNLQRHVRNIHNKERPFRCQLCDRCFGQQTNLDRHLKKHEAEGGDSPSSGDTEHDACFDDIRSFMGKVTCSPGAGSPAATSPHPSHAPHPSHRPSALSIST-