Monarch geneset OGS2.0

DPOGS213534
TranscriptDPOGS213534-TA2517 bp
ProteinDPOGS213534-PA838 aa
Genomic positionDPSCF300033 - 589394-599449
RNAseq coverage423x (Rank: top 29%)
Annotation
HeliconiusHMEL0079010.065.76% 
BombyxBGIBMGA011814-TA0.061.47% 
DrosophilaMrtf-PE1e-6569.23% 
EBI UniRef50UniRef50_D6WGN41e-6954.24%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WGN4_TRICA
NCBI RefSeqXP_973061.21e-7157.41%PREDICTED: similar to Myocardin-related transcription factor CG32296-PA [Tribolium castaneum]
NCBI nr blastpgi|1892352713e-7057.41%PREDICTED: similar to Myocardin-related transcription factor CG32296-PA [Tribolium castaneum]
NCBI nr blastxgi|3287109793e-9640.75%PREDICTED: hypothetical protein LOC100161677 [Acyrthosiphon pisum]
Group
Gene OntologyGO:00036765e-13nucleic acid binding
KEGG pathway 
InterPro domain[423-455] IPR0030345e-13DNA-binding SAP
[190-214] IPR0040187.1e-09RPEL repeat
Orthology groupMCL25889 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213534-TA
ATGATAATTGGAACGAGTGATCTATTAGGTCCGTGGGCGGGAGGCGCCCACGTGGTCAGTGTCGCGTGCACTCGTTACGTCACCCCTAAAGCCTTCGAACAGAGCCGTTCAACGTGCGGCGGTGGCGGTGGCACCGAGGGGTCTGAGGGGCGCGCGTCGTGGCGCGTCACGTTGGACCACGACCTTGTGGAGACCTTGAGCGCCATCTACCCCGAGTGGTTCAAGCAGCCGCGGAGGGAGCGCTCCTCGCCCCTCTCGCCAGCCTCGCCCTGCACACCCCCTGAAGACGACACCTTCAGTTCGGGCGCGGAGGAGATGGCCGGTCGTCAAGCCTCTAACAGCGGTACGAGCGAGCCCAGCTCGCCGCCGCGACTTTCCCCGCCCAAGGTTGTAGTTGACGACAGCCCGTTACAACCGGCCATGGACAAACATAAAGAATCGCTTAAAGTGAAGCTGATGAATCGCCGCTCGTTCAACCAGCTGGTGCAGCAGGGCATCATGCCGCCGTTAAAAACACCGCCGGCTTATTTTGAACGAAGGAAGCAATTAGAGAGAGCTAAGACGGGCGATTTCCTTAAATCCAAGATTCAAAGTAGACCGGATCGACAGGAACTGGAGCGTAGACATATCTTGGAACAGGAAAGTCACGTGGATCCGAGTTTGGCCGAAAAGCAGCGAATGTTAAAAAAGGCGCGGCTAGCCGATCAGTTGAATAATCAAATATCTCACCGACCGGGGCCGCTGGAGCTGATTAAGAAGAACATCCTGCATACGGAGGAGAATATTGAAACTGCCGTGAAAAGTGGCATCCTTCCATTCAAGGCTACCAGTGAGGGTGCGTCTGGCCGGCCCCAGCTCCCCTCGTCCTACTACGGTCCGCCGGAAGAGGTGTCTCCATCCCCTTCCCCGCCGTCTACACTATCGCCTGTGAGCATCACGCTCGCCAGTGAGAGCACGCCGCAACAGCATGCGGCACCCGGCAAGGAACCTAAGAAAAGGAAACCAAAATTGAAGCAACAAATAAAACCACGTTACAAATTCCATGAATACAAAGGTCCGACGAATGCCCAGACTGCGCCCTCGCCTTCCGGGTCTATGGAGACCCCGTACGAGCTTCTACTGCAGCAGCAGCAGTTGCTGTTGCAGCTAATGCTGCCCGCCTCGCCGGCGCCGTCGTCGGCGTCCATAGCGTCGGATTCGTCGGACGCTCTGCCACCGCCGCCGCCGCCCCTACCGGCCTCTACGTCTCTGGTAGCGGCTCGCTTTGAAGAGATGAAAGTGTCCGACCTCAGAGCCGAGTGCAAGAGGAGGAACTTGCGGGTGTCCGGACCTAAACCGCAGCTGATAGACCGCCTTCAGAACTTCCACAGGGAAATGCAAGAGGAGACTCCTCGCTCCCCGGCTTCAGTCGCGTCGCCGGAGTCGAGGGCTGAGTCAGAGCCTTCAGAGGACATAGTGCAGTCGCAGCGGCGACTCATAGAAGAGTTGGAGAGACAATTAGAAGAGTCACGACAGCAGCTGGAGGCGGTGCGGCGGGAGGCAGCGGGAGCAGCAGCTAGCGACCACAGCCGCCGTTTACTTCACGCATATGTATGCGTCACCAAGCTGCGAGCCAAGCTTGATGCGCTGCAGCAGCCGGCCCCCGCGCCGGCGCCCCCCGCCCCGCGGTATGTGCTCGCCGCTCCTGACACTACACAGGATCGTCTCGTCTTCACGGTGACGTCTCCGTCTTCCAATGAAACCTCTGACGTCACACCTACGAAACCTGTCAATCCGGCGTATATCTTGAACGGTGTTAAAGTGGTCCCGATAGCCATCCTCCCCACTGCACACTATGAACCCGAGCGCGCCGTCCCCCCACCACCGCCTCCGCCACCACCATTACCGCCTTTAACTCAGCAGACTACCACCACCCTGCATGATAACGCCGAAGACAGTCAAATTATGAACGATGTCTTAGAAATTCTTGTAGAAAATGGTGAATTACCGCCGTCTGCTGTCGGCGACGTGTCCTCGAACAGATCGATCGACACCGGCTACCTCACGGCCGGGTCGGGGGACTACACGCCCACGGATCTGGCTAACAATGAGTTCAACGCCAACTTCTGCAACTCGGACTCCATATCCAGCCATCACGACGACTTCCTGTCCGGGTTCCCCTCCAACGCCATGGACATAGACGACGAGGTCCCGGAGAGGATGTCCATGACGCCGCTCACCAACGGCGACATGTTGGACCACACGCGCACCGGCTTCGGCGACATGAACGATCTGTCGCTGCCGTGCTTCAACGACGAGCGGCCGTGCGTGTTCGACTTTAACGAGTACGAGCTCGGGGCCGACATGAACGTGGACGACCCTTACGAATATATGAGCACGGAGCTCATACCCAACGTGTTCGGCAGAGGCCACGTGCCCCAGTCAGACCCGGTGCTGGGAGGAGTGCTGCGACCTGCGGCCCCTCGTCCGGCCAGCAAGCACTACTCCTGGGACAAGATCGAGTACGACGCCACTTGA

Protein sequence:

>DPOGS213534-PA
MIIGTSDLLGPWAGGAHVVSVACTRYVTPKAFEQSRSTCGGGGGTEGSEGRASWRVTLDHDLVETLSAIYPEWFKQPRRERSSPLSPASPCTPPEDDTFSSGAEEMAGRQASNSGTSEPSSPPRLSPPKVVVDDSPLQPAMDKHKESLKVKLMNRRSFNQLVQQGIMPPLKTPPAYFERRKQLERAKTGDFLKSKIQSRPDRQELERRHILEQESHVDPSLAEKQRMLKKARLADQLNNQISHRPGPLELIKKNILHTEENIETAVKSGILPFKATSEGASGRPQLPSSYYGPPEEVSPSPSPPSTLSPVSITLASESTPQQHAAPGKEPKKRKPKLKQQIKPRYKFHEYKGPTNAQTAPSPSGSMETPYELLLQQQQLLLQLMLPASPAPSSASIASDSSDALPPPPPPLPASTSLVAARFEEMKVSDLRAECKRRNLRVSGPKPQLIDRLQNFHREMQEETPRSPASVASPESRAESEPSEDIVQSQRRLIEELERQLEESRQQLEAVRREAAGAAASDHSRRLLHAYVCVTKLRAKLDALQQPAPAPAPPAPRYVLAAPDTTQDRLVFTVTSPSSNETSDVTPTKPVNPAYILNGVKVVPIAILPTAHYEPERAVPPPPPPPPPLPPLTQQTTTTLHDNAEDSQIMNDVLEILVENGELPPSAVGDVSSNRSIDTGYLTAGSGDYTPTDLANNEFNANFCNSDSISSHHDDFLSGFPSNAMDIDDEVPERMSMTPLTNGDMLDHTRTGFGDMNDLSLPCFNDERPCVFDFNEYELGADMNVDDPYEYMSTELIPNVFGRGHVPQSDPVLGGVLRPAAPRPASKHYSWDKIEYDAT-