Monarch geneset OGS2.0

DPOGS202831
TranscriptDPOGS202831-TA1719 bp
ProteinDPOGS202831-PA572 aa
Genomic positionDPSCF300018 + 717330-719346
RNAseq coverage90x (Rank: top 63%)
Annotation
HeliconiusHMEL0092800.058.09% 
BombyxBGIBMGA010503-TA5e-15651.66% 
DrosophilaCG9203-PA1e-6852.70% 
EBI UniRef50UniRef50_D6WIH11e-9037.88%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WIH1_TRICA
NCBI RefSeqXP_972573.13e-9137.88%PREDICTED: similar to Zinc finger RAD18 domain-containing protein C1orf124 [Tribolium castaneum]
NCBI nr blastpgi|910800355e-9037.88%PREDICTED: similar to Zinc finger RAD18 domain-containing protein C1orf124 [Tribolium castaneum]
NCBI nr blastxgi|910800353e-9737.54%PREDICTED: similar to Zinc finger RAD18 domain-containing protein C1orf124 [Tribolium castaneum]
Group
Gene OntologyGO:00036772.5e-07DNA binding
GO:00062812.5e-07DNA repair
KEGG pathway 
InterPro domain[15-184] IPR0066402e-52Domain of unknown function SprT-like
[382-405] IPR0066422.5e-07Zinc finger, Rad18-type putative
Orthology groupMCL17406 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202831-TA
ATGAACTTAGCAGACCCGGAATTGGAGCTCATAGACCCGACACCCAATGTTCATATGCTGTTTATACAATTCGACAAAATATTCTTTTATACGAAACTTGCAAGCAGAGCTGTGGTGCGATGGAGTAAAAGAATGTATTCTTGTGCTGGGATATGTTCGTATGATAGGGGAGGACTCTGTGATATAGCTCTCAGTGAACCCCTGCTGAAGTTGAGGCCTCGCAAGGATCTAATTGAGACACTTCTGCATGAAATGATTCATGCATACCTGTTCGTCACATGTCAGGATCAGGACAGAGATGGTCACGGACCCAATTTTAAGTCACACATGTATAGAATCAACAGTGCAGCCGGCCTCAACATCAGTATCTATCATGATTTTCATGATGAGGTGAAACTTTACCAAACGCACTGGTGGAGATGTGATGGTCCATGTCAATCAAGGAAACCACACTTTGGTATTGTCCGTAGAACTAACAACAGAGCTCCAGGACCTTCAGATTACTGGTGGAACGATCATTTAAGAAAATGCGGAGGTACCTTCATTAAGATAAAAGAACCGGAAAATCAAGGTAGGAAAAAAGCAGTCCCTAAAGTTAATAAAGGAGATATCACAAAATACATCAACAATAAAGATAAAATAAAGGGTGTCCCAGGTGACAAATTGACCAAACCAATATTGAAAGATAATAATATGATAAAAAATAATTCAGTAACCAAGAGTAATGGCAGCGGGACAGTTGTTGTTACTAAAAAGAACAATGTGGTGATTAATCCAAAGCCAAACAAACAAATTGAACTCTTCAGCGGAACAGGACACACATTGAGCGGGAAGACCACAACACTCCTGGATATTGCCGAAACTGTTAGAAGTATTTGGGCAAATAAAGAGATACCCACCATGGCACCCGATAGAAAAGAAGTTAAAGGTATGGTAGGTAGAAAGAATGAGAATTCCGTATCTGGTAATAAGCATAAAAGTGATAGTTCCAATCAATATTCTCCCCCCATGAAAATTAAGAAAATTGATGATTACTTCAAAAAGAGCGCCAAAAATGTCTTAAAGGATATCTATGGACAAGACTTTGATATAAAAGAAGTTAATGCAAACAAGAGATTGTCTGTGGTGGCAGTTGAAAATGACTTAGTTGATTGTCCTGTTTGCAGTCAAAAAATTGCCAGTAATCAAGTTAACCAACATTTAGATGAGTGCCTTAATAAAGATATAATAGAAAAGATATCCAAAGATAGCATTCAGCCTGTGATATTTAGTGAGATAAGCAATAAAGATGTCGAAAACCCAAAAATAGAAGTCAGTAAAACAAATGACGTCAAAGAAACTTGTAGACAAATAAATAAAAAGTCAAAGATAAAAAATGAAACCGGTGTACAAATTAAAATGGAACCCGGGACTAGCAGGGACGTAGAAGTCGAAGCTGGAACGAGCAAATCACTTAATGAACAGAAATGTCCATGCTGCGAAAAAAACATAAACAACACTATGGATGAACATCTGGATGAATGTTTAGCTTTGTTCAGTGATCCGACAACAGCGCCCGCCGAGGGTGACACCAGTTTGATAGAAACCATTGAAATTGAAGATGATCTGGATGAATCTCTGACCTTCAACTCCACCGGCACCAAATGCCCTTGTCCCTGCTGTCTCCAAATGATTGAGCAAGCCGATATGAATTCCCATTTAGATTCATGTTTGAGTTGA

Protein sequence:

>DPOGS202831-PA
MNLADPELELIDPTPNVHMLFIQFDKIFFYTKLASRAVVRWSKRMYSCAGICSYDRGGLCDIALSEPLLKLRPRKDLIETLLHEMIHAYLFVTCQDQDRDGHGPNFKSHMYRINSAAGLNISIYHDFHDEVKLYQTHWWRCDGPCQSRKPHFGIVRRTNNRAPGPSDYWWNDHLRKCGGTFIKIKEPENQGRKKAVPKVNKGDITKYINNKDKIKGVPGDKLTKPILKDNNMIKNNSVTKSNGSGTVVVTKKNNVVINPKPNKQIELFSGTGHTLSGKTTTLLDIAETVRSIWANKEIPTMAPDRKEVKGMVGRKNENSVSGNKHKSDSSNQYSPPMKIKKIDDYFKKSAKNVLKDIYGQDFDIKEVNANKRLSVVAVENDLVDCPVCSQKIASNQVNQHLDECLNKDIIEKISKDSIQPVIFSEISNKDVENPKIEVSKTNDVKETCRQINKKSKIKNETGVQIKMEPGTSRDVEVEAGTSKSLNEQKCPCCEKNINNTMDEHLDECLALFSDPTTAPAEGDTSLIETIEIEDDLDESLTFNSTGTKCPCPCCLQMIEQADMNSHLDSCLS-