Monarch geneset OGS2.0

DPOGS203076
TranscriptDPOGS203076-TA1074 bp
ProteinDPOGS203076-PA357 aa
Genomic positionDPSCF300294 + 251-4614
RNAseq coverage1x (Rank: top 93%)
Annotation
HeliconiusHMEL0041652e-14671.47% 
BombyxBGIBMGA001852-TA2e-13680.14% 
DrosophilaXNP-PB5e-7349.23% 
EBI UniRef50UniRef50_D6WPP96e-7350.38%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WPP9_TRICA
NCBI RefSeqXP_001810058.13e-7347.87%PREDICTED: similar to transcriptional regulator ATRX (X-linked helicase II) [Tribolium castaneum]
NCBI nr blastpgi|3838633687e-7354.27%PREDICTED: uncharacterized protein LOC100874907 [Megachile rotundata]
NCBI nr blastxgi|2700104031e-7047.87%hypothetical protein TcasGA2_TC009794 [Tribolium castaneum]
Group
Gene OntologyGO:00036774.1e-44DNA binding
GO:00055244.1e-44ATP binding
KEGG pathway 
InterPro domain[35-277] IPR0003304.1e-44SNF2-related
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203076-TA
ATGCCCGAAACAAGGCCACGTGTCCGGAACAAGAGACGCAGCAACAAGAACAAAGTCAAAGACAAGAAATCGTTGGCGCGTAGCACTGTGGTAGCAAACATGGCGGAGTTCGAGCGCAAACGGAGGCTAACGATACGCCAGGCACAACTAGTATTTGAATTGTCAAAACTGAAGAAGACATACGAACGAGCATATCAGTTGGAGGACTGGTATAATGGAGGTGGGATTTTTATCATTGGATACGAACTATTTAGGAGTCTGAGTACTTTGGACCCTGTTCTTGACGGTGTGAGGCCCAAAGTCCTGAACAAAATCCGCACAGCGCTGTTGGATCCTGGTCCGGATATCATAGTTTGTGATGAAGGTCATTTGTTGAAGAACGACTGCTCTATACTGGCGGTAGCCATGAGTAGAGTGGTCACTAAGAGACGTATCGTACTCACGGGCACTCCCATGCAAAATAATCTCAGGGAATATTACTGCATGGTGAATTTCGTTAAACCGAATTTATTAGGATCTTACTCGGAATACTCCAACAGATTCGAGAATCCGATTATGAACGGGCAGCACAGGGACTCGAGAGAGGAAGATATAAAATTGATGAAGGCTCGTACACATATTCTTCACAAAGTGTTAGAAGGGTGTCTCCAGCGTCAGGAGGCGTCCGTGCTGTATCCGTACTTGCCAAAGAAATACGAGTACACTGTGTTCATATCACTGACGAAGTGCCAGTGGGAACTGTACAAACATTACCTGACGCATTACGCGAAGGACACCAAGCAGAGTGTGCTGAGAGACTTCCACGTGCTACAGAAGGTGTGGACCCATCCTCAAGTGCTACATAACTTCTTGACGAAGACGCGCGCTGACGAAAAGGAACCCAAAGTCAAAGTGGAGAAGATAGAGAAGTTGGACGATGATCTCGAAGAGTCTCCCGAGCACGTGGCCGCGGCGGCCGAGTGGTGGGCGAGCACACAACACAGACACGAACTGAACGAACTGGATACCAAAAAATATAAAGTTAAATACAAACAATTATATATACGTATATATATATATAAGAGCTGTAGGTAA

Protein sequence:

>DPOGS203076-PA
MPETRPRVRNKRRSNKNKVKDKKSLARSTVVANMAEFERKRRLTIRQAQLVFELSKLKKTYERAYQLEDWYNGGGIFIIGYELFRSLSTLDPVLDGVRPKVLNKIRTALLDPGPDIIVCDEGHLLKNDCSILAVAMSRVVTKRRIVLTGTPMQNNLREYYCMVNFVKPNLLGSYSEYSNRFENPIMNGQHRDSREEDIKLMKARTHILHKVLEGCLQRQEASVLYPYLPKKYEYTVFISLTKCQWELYKHYLTHYAKDTKQSVLRDFHVLQKVWTHPQVLHNFLTKTRADEKEPKVKVEKIEKLDDDLEESPEHVAAAAEWWASTQHRHELNELDTKKYKVKYKQLYIRIYIYKSCR-