Monarch geneset OGS2.0

DPOGS214455
TranscriptDPOGS214455-TA1653 bp
ProteinDPOGS214455-PA550 aa
Genomic positionDPSCF300441 + 5774-10271
RNAseq coverage1317x (Rank: top 10%)
Annotation
HeliconiusHMEL0044392e-5539.40% 
BombyxBGIBMGA009574-TA2e-1753.42% 
Drosophila% 
EBI UniRef50%
NCBI RefSeq%
NCBI nr blastp%
NCBI nr blastxgi|1543320733e-1728.89%microtubule-associated protein [Leishmania braziliensis MHOM/BR/75/M2904]
Group
Gene OntologyGO:00036771.2e-07DNA binding
GO:00063551.2e-07regulation of transcription, DNA-dependent
GO:00055157e-07protein binding
KEGG pathway 
InterPro domain[375-428] IPR0150109e-10Rap1 Myb
[374-429] IPR0122871.2e-07Homeodomain-related
[371-432] IPR0090577e-07Homeodomain-like
Orthology groupMCL35021 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214455-TA
ATGTCCGGAGTAAAGTTCTATGTTCTGGGAGGTGGACGATGGCGGGAGGTGGATATGTCGTCTGGTCTTGTGGAGGGCTGGGGGCCAGCGCGGTTCCTGGTGGCCCGCGGGGGTCGGCTAAGGGCTCAGCGGCGAGCCCCCGTGTTGAGAGGGTTACGGGCACTGGCGGTTGGCAGGCCTACCACCGTACATGATGGACCGACGCTCTCTCAGATCAGATACGACACGGTCTCCAAACCGAAGCCCCGTCCCGCCCGCCGGCAGCTCTGCGACCCCTGTCACGTCCCATGCCACGCTCTGTGTCGTCCGCAACACTACGGGTGCGGTCACAACCCGGTGTACAACGAACAGGCGTTCGCACTCCCCGCGGCCGGTCCGCGAGTCTCACACGCCTCGGTACAGGCTGACCTCACACCAGACAGGCAGAGCAACCAAGATGGCCGACACAAGACCGAGGAGGCGACGCGGAGAGACAAGGCCGACACCTCACACGGAGCAAAGAACACCGAGATCAAGATCACTCACAGGACACGACATGACCGACGGCAGACTCGGGCGCGTAGCGACAGGAAGACGTGTCACTCGGAGGAGGAGCCGGAGAGCAAGAGGAAACGGAGGGAGGGAGGAGGGGGCGGAGTGTGCAAGGAACTGATCATTACCGAGGAGGACGTGCTGCTGTTGAGAGACTTCCTAGTGAAGGACGACAAGAACGTGTTTCAAGTAGACCTCGAGAGCTTCGACGCCGACGACGCATACAAGAGGCGCGTGTTCTACAGACTGAACCCGCTCGTCAGCATGGAGCGGTGTGCGCTCACCGACACGAGGCGGGGGGAACGAGCCAGCCCCTCCACCGTCACGCTCAAGGACTTCATGTCCAACCTCGGCTTGAAGGAGGTGGAGTCGGAGCAGCACACGAGCTCCTCCTACAGGACGAGGAGGAGGACTCGGGGCGACGGAGACGACAAGGAGAACAGGGACAAGGACATTCGCACCAAGACACGGGACGGAGCGGCCGGGGTGTTGAGAGACGGCTCGGGCAAGACCAACAGAGGCACGGACAGCAGACACGACCGGAGACAGGCGGCCGAGCACGTGAACAAGCGCGGCAGGGCGTGCCGGGAGTACAGCGCGGCGGAGGACGCGGCCATCGTGCGGCTGGTGTGTGACGGAGCGAGGGGCGCCAGGGTCAACGGCAACACGCTCTGGCGGGAGCTGCAACACGACCACCTGCGGCTGACGGGACACGCCCGGTCCTGGCACTCGCTCCGTAACCGCTACCTCCGCTACGTGCTGCCGTCGCTGTCGTCGCTGGTGTCTCCGAGCGTGGCGTCCCGGCTGCGCGCCGCGGCGGCCGCGGGGGAGATAAAGCGCGGGTCGCGGGCGCGGCCGGCCGACAGCTCGTTCCACCGCGTGCCGGCGTCGATGTATCTGTTTGTGTATAATGTAGGTACGGTAGTGCTCCGCTCCCGCGCCGTCCCGCGCGTGTCCCGCACGCCGACCTACGACGAACTCACGAGGAGATTCAACGAGCGACACGCACCCGACTCGGACTCGTCGGTGACGGAGCGCCGCGCCTCCCGGAGACTGCGCGCCGACCGCGCCGCAGCCCCGCATAAGGCGCACGCGAGGCGCCTCTACAGCCACGCCCAATGA

Protein sequence:

>DPOGS214455-PA
MSGVKFYVLGGGRWREVDMSSGLVEGWGPARFLVARGGRLRAQRRAPVLRGLRALAVGRPTTVHDGPTLSQIRYDTVSKPKPRPARRQLCDPCHVPCHALCRPQHYGCGHNPVYNEQAFALPAAGPRVSHASVQADLTPDRQSNQDGRHKTEEATRRDKADTSHGAKNTEIKITHRTRHDRRQTRARSDRKTCHSEEEPESKRKRREGGGGGVCKELIITEEDVLLLRDFLVKDDKNVFQVDLESFDADDAYKRRVFYRLNPLVSMERCALTDTRRGERASPSTVTLKDFMSNLGLKEVESEQHTSSSYRTRRRTRGDGDDKENRDKDIRTKTRDGAAGVLRDGSGKTNRGTDSRHDRRQAAEHVNKRGRACREYSAAEDAAIVRLVCDGARGARVNGNTLWRELQHDHLRLTGHARSWHSLRNRYLRYVLPSLSSLVSPSVASRLRAAAAAGEIKRGSRARPADSSFHRVPASMYLFVYNVGTVVLRSRAVPRVSRTPTYDELTRRFNERHAPDSDSSVTERRASRRLRADRAAAPHKAHARRLYSHAQ-