Monarch geneset OGS2.0

DPOGS214939
TranscriptDPOGS214939-TA2298 bp
ProteinDPOGS214939-PA765 aa
Genomic positionDPSCF300280 - 139395-154845
RNAseq coverage429x (Rank: top 28%)
Annotation
HeliconiusHMEL0155942e-14870.77% 
BombyxBGIBMGA004821-TA0.077.97% 
DrosophilaMTA1-like-PA0.052.44% 
EBI UniRef50UniRef50_D6WR600.055.89%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WR60_TRICA
NCBI RefSeqXP_975498.20.055.89%PREDICTED: similar to MTA1-like CG2244-PB [Tribolium castaneum]
NCBI nr blastpgi|1892394910.055.89%PREDICTED: similar to MTA1-like CG2244-PB [Tribolium castaneum]
NCBI nr blastxgi|1892394910.056.05%PREDICTED: similar to MTA1-like CG2244-PB [Tribolium castaneum]
Group
Gene OntologyGO:00036772.7e-17DNA binding
GO:00055152.9e-12protein binding
GO:00063551.1e-06regulation of transcription, DNA-dependent
GO:00435651.1e-06sequence-specific DNA binding
GO:00082701.1e-06zinc ion binding
GO:00037001.1e-06sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[52-122] IPR0010252.7e-17Bromo adjacent homology (BAH) domain
[231-287] IPR0090572.9e-12Homeodomain-like
[230-279] IPR0010051.2e-07SANT domain, DNA binding
[357-393] IPR0006791.1e-06Zinc finger, GATA-type
Orthology groupMCL10660 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214939-TA
ATGAGGGAGAACAATCGAAAATATCAGGAGCTTCGCGATTTACCACCGGGTACCATCGGCGGTCGCGCGCATGCCTGTTTTCTTAGGTCGCGCCCGACGCTGCCTTCCGATGTGGCAATGGAGGAGGAATCTACGGAGCTGCCTGGTTCTGACGGGCTCGCGCCCAAACAGCGGCACCAGGCGAAGCAGCGCGAGCTGTTCCTGTCGCGGCACGTGGAGACCCTCCCAGCCACGCACATCCGCGGCAAGTGTACCGTCACGCTGCTCAACGAGACGGAGTCGCTGCTCAGCTATCTCAATAAGGATGACGCATTTTTTTATTGTTTAGTATTTGATCCTTCACAAAAGACTTTATTAGCAGATAAGGGAGAAATCAGAGTTGGAAGTAAATATCAGACTGAAGTAACTAATTTATTAAAAGAAGGTGAGATGATTTCTTTAACTAGTTATGATGAAAGTAACAAGATCGACCAATTCCTGGTGGTGGCTCGGTCTGTGGGCACCTTCGCCAGGGCATTGGACTGCAGCTCCAGTGTTAAACAGCCCTCGCTACACATGTCCGCGGCGGCCGCCAGCAGGGACATAACTCTTTTCCACGCCATGGACACGCTGCACAAGTCCGGGTACAGCATAGAGGCTGCTCTGTCGTCGCTGGTGCCGGCCTCCGGGCCTGTGCTGTGTCGCGACGAGATGGAGGAGTGGTCGGCCTCAGAGGCCAACCTGTTCGAGGAGGCGCTCGAGAAATACGGCAAGGACTTCGCTGATGTACGGAAGGACTTTCTGCCGTGGAAGACGCTGAAGAATCTGGTGGAGTACTACTACATGTGGAAGACGACGGATCGCTACGTGCAACAGAAACGGGTGAAGGCTGTGGAGGCGGAGTCCAAGCTGAAGCAAGTGTACATTCCCAATTATAACAAACCGAATCCAGCGTTGTTGTCGAGCGGCGCGGCGGCTATCACGAGCGCGGCGGCCCCCCCGCCTCCGGGGCCCCGCCCGGCCGGCGTCGCCAACAAGGGAGCCGTGCTGAACGGAGGAACCAACGGCACAGCGGCCGCACCCACCATGTGCGCCTCGTGTCAAGTGACAAATTCAAACCAGTGGTACGCCTGGGGACCACAGCATTTACAGTACAGATTATGTGGCGCTTGCTGGCAGTACTGGAAAAAATATGGAGGACTTAAGACGGCGGGAGTGTTCGGCGAGAGCGAGGCGGAGGCGGGGCGCGGGGTGCGGGCGGAGGCCGACGACACAGCACTGTCCGTGTCGCACAGACCGCACCGGTGCTCCGTGGTTAACTGCGCCAAGGAATTTAAACTGCGCGCTCACCTGGCCCGCCACATGGCGACTGCTCACGGCGGCGCGGGCGAGGGCGCTCGGCCCGTCATGAAGACCCGAGCCGCCTTCTATCTCCGCGCCTCGCCCTTCACGAGACTCGCGCGCCGCCTCGCCCGCGCCCTCCGCAGACCCAGGCACTACGCGCGCTCACCCTTCTCACCGATCAACCTGCACCAGGTCAAACACGAGTGTACGATAGCGATGGCGGGCGGCGTCGGTGGTGTGGGCGGTGTGGGCGGCGTGGTCCCGGCGGAGGTCCGAGGCGTCGCTCGTGCCCGCGGGCCCGTGGGCGCGGTAGCGGCCCGACTCGCCGCCGCTCTGGGCACGTCCGCGCCTCGAGCCCAGGACTGGCTCACCCTCACCCCGCGCGAACGTCTGCCCACACCCAACCACGTCGCCTTCCCCAAGCCGCCCAAGGCCCCAGATGGCAGCCTCATGTACGAGCGTGTGGTGTCCCGCGCGGAGCTGGAGGCGCGCCGCAGCGAGGCGGCCGCGCCGGCTCTCAAGCGGCGCGCCTACGACGACATCAACGGCCTCGACAGAGGTTGTGGTGGTAGCGCGCCTCCCGCCAAGCGACCCAACAAGCATCCGGCGCCCATGCAACGTCCATCACGCGAACAGTACGCGGCCATGTGCGCGCGAGCCCAGGCCACGGGACAACCTCTGCCCGCACACGTTTTTGCACACGTGAACGGCAAACCGACGAACCTGACCGGCCGCGGCGGTCGTCGCCACGTGATCTCGTGGATGGACGCTCCGGACGACCTCTACTTCAGAGCCACCGAGACCGCCAAAGCCGCCCGACGGACGCTGAGCTGCGGCGAGCTGAGACGCGGCGCCCGCGCTCCGTGGCGCGTGATGCGCGGGGCGGTGGCCGGCGTGGTGCTGGGCGCGGCGGCGGCGGCGGGCGGCAAGGCGGGCGCCGCCTCCGCCCCGCTGCAGCTGGTGATCCTCGACTGA

Protein sequence:

>DPOGS214939-PA
MRENNRKYQELRDLPPGTIGGRAHACFLRSRPTLPSDVAMEEESTELPGSDGLAPKQRHQAKQRELFLSRHVETLPATHIRGKCTVTLLNETESLLSYLNKDDAFFYCLVFDPSQKTLLADKGEIRVGSKYQTEVTNLLKEGEMISLTSYDESNKIDQFLVVARSVGTFARALDCSSSVKQPSLHMSAAAASRDITLFHAMDTLHKSGYSIEAALSSLVPASGPVLCRDEMEEWSASEANLFEEALEKYGKDFADVRKDFLPWKTLKNLVEYYYMWKTTDRYVQQKRVKAVEAESKLKQVYIPNYNKPNPALLSSGAAAITSAAAPPPPGPRPAGVANKGAVLNGGTNGTAAAPTMCASCQVTNSNQWYAWGPQHLQYRLCGACWQYWKKYGGLKTAGVFGESEAEAGRGVRAEADDTALSVSHRPHRCSVVNCAKEFKLRAHLARHMATAHGGAGEGARPVMKTRAAFYLRASPFTRLARRLARALRRPRHYARSPFSPINLHQVKHECTIAMAGGVGGVGGVGGVVPAEVRGVARARGPVGAVAARLAAALGTSAPRAQDWLTLTPRERLPTPNHVAFPKPPKAPDGSLMYERVVSRAELEARRSEAAAPALKRRAYDDINGLDRGCGGSAPPAKRPNKHPAPMQRPSREQYAAMCARAQATGQPLPAHVFAHVNGKPTNLTGRGGRRHVISWMDAPDDLYFRATETAKAARRTLSCGELRRGARAPWRVMRGAVAGVVLGAAAAAGGKAGAASAPLQLVILD-