Monarch geneset OGS2.0

DPOGS207752
TranscriptDPOGS207752-TA2385 bp
ProteinDPOGS207752-PA794 aa
Genomic positionDPSCF300042 - 569470-581239
RNAseq coverage582x (Rank: top 22%)
Annotation
HeliconiusHMEL0119680.069.60% 
BombyxBGIBMGA005306-TA0.069.49% 
DrosophilaCG17494-PA7e-3734.50% 
EBI UniRef50UniRef50_D6WXZ71e-5747.08%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WXZ7_TRICA
NCBI RefSeqXP_968078.14e-5847.08%PREDICTED: similar to SLMAP protein [Tribolium castaneum]
NCBI nr blastpgi|2700124745e-5747.08%hypothetical protein TcasGA2_TC006629 [Tribolium castaneum]
NCBI nr blastxgi|3838623514e-5827.68%PREDICTED: uncharacterized protein LOC100881837 [Megachile rotundata]
Group
Gene OntologyGO:00055151.7e-21protein binding
KEGG pathway 
InterPro domain[33-157] IPR0089841.7e-21SMAD/FHA domain
[55-145] IPR0002532.1e-20Forkhead-associated (FHA) domain
Orthology groupMCL25125 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207752-TA
ATGGTGATAGCAAAGATAAAGAGCGCTAAGGGGGCACAAATGAGTAAAACAATAAAAAATATGGAGTATAACTGGATGAGAGGAGCAATAAATCCAGATCCCCCTAATAATCAAGAAGCGGGAGTTCCTAAGGCAGTCTTCATACCTCACTCCAATTCTCTGCCATTTGAGGAACGACATGTCACATTGGAGCATCCAGTGAAGATCGGCAGAAATGTGTACCGTGCGACACCATCGCCAACGAATACCATATTCGAATGCAGGGTCCTCTCCAGGCACCATGCGACGCTGTACTACGACAAGGGACATTTTTATCTATTGGACAATGAGAGCAGCAACGGGACGTTCGTGAACAATAACCGTTGCACGCTCACTAACACGGAGCCGCACGAGGTGTTCTCTGGGGACGTGGTGCAGTTCGGGATACCTGTGGTCGAGAACACAGCCAACTCTGAGAAGAACCCGTTCCCACCGGTGATAGCACTGTTGAAACTGTACCACCCCGACGGTAGTGAAGCCCAGCTGTCCTTCAACCCGTCGATGACGACACCGCTCCATCTGAAGGAACTGTACCAGCTGAACAACTTCGTTCAGGAGGCGATACAACGCGAAGCTTCCTTAGAGACACGCCTGCGGACGTTAAGGGAATGCGTTGAAAGGACGAGATCCGAAGCGGCGGCCTCGTGGGAGCTGTTCGTGGGGGAGCAGCGGCTCCTGATGAGGGTCCACACCCTGGAAGCAATGCTGGCAGCTGGGAAACATCCTGACGAAGTACAGGTCGCCCAGCTGCTCAGCGATAAGAGGAACTACCAGGAAGTGGCACAGGAGAGTCTCCGTATGGCGCACGAACAACGTCTGTCTCTAGAGGAATCGCTGGAGCGTCGCAGTAGGGAGGCGGCGGCGCTGCATCATCAGAACTACGCCTTACATTTGGCCGCCACCAACGCTATGCAAGAATTGCAAAAATTGGCTGCGCGCTGTGAGCGTAAGATGTGCGCGGCGAGGCTCGCGGTAACCGCGGCCGAGGAGAGGGAACAGGCGCTGAGAAAACATCTTCCACTAGCGTATTCGGTACAAAACGGCGAGGCGAAAATGTTAGTAGTGAGTACAGACAGCAACAAACTGCAAGAAGCCAAACGAACCGGCTCCTTGGAGGAAGCGGTCCGCGCCCTGCCCGACTACATCAAACTGTTGTTGCCACAGAATCTGCTTAATAAGGTTGGCATCAAAGCTGTGTCGGCCGAAGGGAAGTCAGCCAATGAGTTTGAATTAATATTGAAGAAGAACAACTTGCATAATTCCGAAGGCAGAGAAAAACAAGAGAAGGAGGAAGGCGGTGTCGGCGGCAGCGGGGAGAAGTCCTCCGAGGCTGATCTGTGTACAGACGACGAGAAACATTCGAGACTCAACCACGATGGTGGGAATGAGGAATCTTTGAGTCCCTTGGACGGTGAACTACGTCCCGATAATAATGCCAATTCGACCTACAACGACGCTGATGTCAAAGAAGACTCGCCGAGCAAAGCCGGCGTGGAGTACGCGAACATCCTGCGCGTGATGGCCGGCCTGAATGAGGAGATAAAAGTGTTGCGGGAGAGGGTGAGCGTCGCCGCGGCAGAGAACGAGGCTCTCAGGACGGTGAGGGACGAGCTGCTGGCCGCCAGGGACGAGCCGCGAGAAGAGGACAGCGAACACGACGCGGCCGCCTACAAGGCCAGGATCTCTGAATTACAGGAGCAACTGACTCGGTCTACGGCTACAGAGGAGGCGAATCTCGCTAAGATCCAGCGACTGACGACGAATGCGGCCGAGATGCAAGCCGAACTCGCCTTCAGACCCACCAGAGACGACATCGACGACCTCACCACCATGGTCGGGAAGCTGAGGGCGGAGGTGCTGGAGAGAGACGACGTCATCCAGCGACTGCAGGCGCAGCTCAGGGAGAGGGCGGACACCGTGGACCGCGCCGTGGAGACGTCGAGGTCATTAGAGAACATCAAAGAAGCCGCCGACGACATCGACACGAAACAGATATCGGCCGACATAGACGAGATGTTCCGCCTGGACTACGACGACGACGAGAGCGACTCCGCCGCCACCGAGATCAACAGGGACGAGGAGCTGAGCGCCGGCGAGTACGTGAAGCTGGACGACGACGAGCGGCTCAGGGTCACCATGAGGAACGGGTCACTGCACGCTTTGGAGGAGGAGCTGGTGCGGGCCAAGGAGCGCTGGGCGGAGGTCTGCGCCGAGAGGGCGAGGCTGGCGGCCCAGCTCGCCTCCGCACAGAAGCCACTGAGGTTCGACATCGGCCACGCGCTGGCCCTGGCGCTGCCTCTGATGCTGGCCTGCCTGTACTATATGCTGCTGCCTTACCTGTCCTGA

Protein sequence:

>DPOGS207752-PA
MVIAKIKSAKGAQMSKTIKNMEYNWMRGAINPDPPNNQEAGVPKAVFIPHSNSLPFEERHVTLEHPVKIGRNVYRATPSPTNTIFECRVLSRHHATLYYDKGHFYLLDNESSNGTFVNNNRCTLTNTEPHEVFSGDVVQFGIPVVENTANSEKNPFPPVIALLKLYHPDGSEAQLSFNPSMTTPLHLKELYQLNNFVQEAIQREASLETRLRTLRECVERTRSEAAASWELFVGEQRLLMRVHTLEAMLAAGKHPDEVQVAQLLSDKRNYQEVAQESLRMAHEQRLSLEESLERRSREAAALHHQNYALHLAATNAMQELQKLAARCERKMCAARLAVTAAEEREQALRKHLPLAYSVQNGEAKMLVVSTDSNKLQEAKRTGSLEEAVRALPDYIKLLLPQNLLNKVGIKAVSAEGKSANEFELILKKNNLHNSEGREKQEKEEGGVGGSGEKSSEADLCTDDEKHSRLNHDGGNEESLSPLDGELRPDNNANSTYNDADVKEDSPSKAGVEYANILRVMAGLNEEIKVLRERVSVAAAENEALRTVRDELLAARDEPREEDSEHDAAAYKARISELQEQLTRSTATEEANLAKIQRLTTNAAEMQAELAFRPTRDDIDDLTTMVGKLRAEVLERDDVIQRLQAQLRERADTVDRAVETSRSLENIKEAADDIDTKQISADIDEMFRLDYDDDESDSAATEINRDEELSAGEYVKLDDDERLRVTMRNGSLHALEEELVRAKERWAEVCAERARLAAQLASAQKPLRFDIGHALALALPLMLACLYYMLLPYLS-