Monarch geneset OGS2.0

DPOGS209158
TranscriptDPOGS209158-TA1767 bp
ProteinDPOGS209158-PA588 aa
Genomic positionDPSCF300061 - 233613-241705
RNAseq coverage827x (Rank: top 16%)
Annotation
HeliconiusHMEL0097574e-13161.19% 
BombyxBGIBMGA011476-TA3e-12054.15% 
DrosophilaCG17168-PA9e-8054.58% 
EBI UniRef50UniRef50_Q9W5Q11e-7754.58%CG17168 n=9 Tax=Eukaryota RepID=Q9W5Q1_DROME
NCBI RefSeqNP_001015254.13e-7854.58%CG17168 [Drosophila melanogaster]
NCBI nr blastpgi|628622145e-7754.58%CG17168 [Drosophila melanogaster]
NCBI nr blastxgi|1953963374e-8838.21%GJ16691 [Drosophila virilis]
Group
Gene OntologyGO:00055154.2e-30protein binding
KEGG pathway 
InterPro domain[427-552] IPR0002534.2e-30Forkhead-associated (FHA) domain
[425-555] IPR0089847.7e-29SMAD/FHA domain
Orthology groupMCL12652 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209158-TA
ATGAAATCCAAAAAGCGTCACCACAATTCATCTGACGAGTCCGAAGAGTCTGACGGAAGTGTACAATCGAGTAGTTCATCAGATAGTGATAGTGATGATTCTTCGGTGTCGGAAACCCGCTATAAGAAAAAGCGAAAACGTGCCAACAACTCTTCCGAATCCGAAAGTAATTCTGATGACCAAAAGCGATCCAAAAAACGTAAGCATAAGAAGAAGAAGAAGAAACATAGCAAGAAGAAGAGGCGGTCGGATTCGTTAGAAGATGTTACCGACGATAATATTTCTATAAGCGAGGGTGAAATATCTGTGAAGAAAAAGCGGAAGCATAAACATAAGCACTCGAAACGGTCACAGACTTCTGGAAGCGAAGATGAGTTGGAACGGCATAATGAACGCCGGCTAATGAGTGTTGTTAAGGAAAGACGCGGATCTTCAGACGAGGGCAGCCAGCCTTCCCGCACTTACTATCAGAAAGAATATCACGGTTCAATGCATCCCAGAGATTATGATATTGATCAAAATTCTCAAAAGTATTACAAGGAACCAATGAGGGAACACGGCTATCAGGATTACCAGGACTATAGGGAAAGGGATGACAGATATCAAGATGAGAATCGTCAAAGGGAGAAATATGGTCACAGTTCACACAACGCTGATTACGGTGGTCAGAGAGAATACAATGTCTATAAAAGATCATCGAAATATGAGTCTAGACCGGATGAAGTCTACAGAAAAGAAAGGGACGATCCCTATGGTCCCAGAGAAGATATGAGAGGCCATCCGAAATATCCTCAGAGGTTTGATGAGAGACCTCCAAAGAGGTTTGATGACAACCGTGATTACCGTGACAGACGTGTACAGGAAGCTGAACGCTACAGAGAGAAGGCCTATGAAAAGAGAGAAAAATATGGAGGAGATGGAAATTCAGTGGAAAGAGAGGAATCAAGATCAAGAAGTCCGGATGAGCGCTACAGGGAAAAGAACAGAGATAGAAATGATCGTTACAGGGGGGAGGACACAAGGAGGGGAGAGGAGAGAGGGGAGGGGAGGAGGGTTGAGGAGAGGGGAGAGGGGCGGCGGGTTTATTATAATCTGTGTTGTAACGTTTATTTTACTGACAACGCTAATGATAATAAAGAATACACCTGGGGGAAGACGGAGGTGAAGAAGGAAGGGGCCAAGAATCCAGCTGATAAAGAGAAGCCTAACTTTGGATTATCAGGGAAGTTGACAGCAGACGCTAACACAGTGAATGGAGTGGTCATCAAATACACTGAGCCTGATGATGCAAAGCAACCCAAGAGACGCTGGAGGTTCTATCCGTTCAAAGGCGACAAGGCTCTCCCGATCCTGTACATCCATCGCCAATCCTGCTTCCTCATCGGCAGAGATAAAAAGGTCGTCGACATAGCCCTCGAACACCCATCCATAAGCAAGCAACACGCGGCGTTGCAGTACAGAGCGACTGCCTTCACCAGGGACGATGGCACTCAGGGGAGACGTGTCAGGCCTTATATCATAGATTTAGAATCGGCGAACGGCACGTTCGTGAACAACAAGAAGATAGAGGCCCGCCGCTACGTGGAACTGCTCGAACGAGACGTCGTCAAGTTCGGCTTTTCGGCGCGGGAGTACGTGTTGCTGCACGAGAACAGCAAGGACGAGGGCCAGGACGACGACCAGGAACCCGCCCCCGCCCTCACCACCGTCGACCAGCTGAAGAGGGAGAAGCACGCCAAGGAGGCGGCGGCCGCGGACGGGGAGTAA

Protein sequence:

>DPOGS209158-PA
MKSKKRHHNSSDESEESDGSVQSSSSSDSDSDDSSVSETRYKKKRKRANNSSESESNSDDQKRSKKRKHKKKKKKHSKKKRRSDSLEDVTDDNISISEGEISVKKKRKHKHKHSKRSQTSGSEDELERHNERRLMSVVKERRGSSDEGSQPSRTYYQKEYHGSMHPRDYDIDQNSQKYYKEPMREHGYQDYQDYRERDDRYQDENRQREKYGHSSHNADYGGQREYNVYKRSSKYESRPDEVYRKERDDPYGPREDMRGHPKYPQRFDERPPKRFDDNRDYRDRRVQEAERYREKAYEKREKYGGDGNSVEREESRSRSPDERYREKNRDRNDRYRGEDTRRGEERGEGRRVEERGEGRRVYYNLCCNVYFTDNANDNKEYTWGKTEVKKEGAKNPADKEKPNFGLSGKLTADANTVNGVVIKYTEPDDAKQPKRRWRFYPFKGDKALPILYIHRQSCFLIGRDKKVVDIALEHPSISKQHAALQYRATAFTRDDGTQGRRVRPYIIDLESANGTFVNNKKIEARRYVELLERDVVKFGFSAREYVLLHENSKDEGQDDDQEPAPALTTVDQLKREKHAKEAAAADGE-