Monarch geneset OGS2.0

DPOGS208689
TranscriptDPOGS208689-TA4017 bp
ProteinDPOGS208689-PA1338 aa
Genomic positionDPSCF300043 - 524264-528280
RNAseq coverage130x (Rank: top 56%)
Annotation
Heliconius% 
BombyxBGIBMGA007799-TA9e-4653.12% 
DrosophilaFancd2-PA2e-6822.95% 
EBI UniRef50UniRef50_F5HPU90.045.45%Fanconi anemia, complementation group D2 n=2 Tax=Obtectomera RepID=F5HPU9_BOMMO
NCBI RefSeqXP_001810863.13e-11125.80%PREDICTED: similar to Fanconi anemia D2 protein [Tribolium castaneum]
NCBI nr blastpgi|3505367790.045.45%Fanconi anemia, complementation group D2 [Bombyx mori]
NCBI nr blastxgi|3505367790.045.47%Fanconi anemia, complementation group D2 [Bombyx mori]
Group
KEGG pathway 
Orthology groupMCL12864 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208689-TA
ATGTCTCCAAAACGGAGAAAAACTATTCACGAAGATTATTTTGAAACTACTTTAAAAGAAAGTGGAATTGATCTAGCCAAACCCCCTGAAAGATGTGTAGCAAAATATGACATAATAGTTATTACGCGAAATTTAAAGAAAATTTTACAAAAGCACTCAGATTATCCCCAAAATTTATCTGAATTCTTCGATAATTTTGTTGAAAGATGTCAAGACTTAGAAATGTTCAAGCATTATTTATTTCCTAATATTGTCAGAAAAACGACAGAAGATCAGTCTATTCAGTGCAAAAACGACAGTATTGTTAGAATTCTTCTCACTATACCATTGTTACAGAATAAATTAATCAACTATATATTCGAAAAAGCTATTGACCTGGCCGCGGATTCAAAATGTGGGCCCTGGATCAAAATGATTTTAAGATCTTTATGCACTTTAGATAATATGATAGATAGTGACAATATAGCTACTAATATAATAAGTTTATTAGATGTTACTTATGAAGAGTTAGTGCAACTTGAAATAATAACTACTATTCCAGATATAATAGGTGACCAAGCACATGACAAAATTGTTATAGAATTAAGTAAAATATTGAAGCAAAAGGATCACAAGCTTATACCCGCTACACTTGACTGCCTTTCTTATTTGTGTCTGTCTAATGATCAATATGAAGAATTGAGGAATGAAACTTTGAACATTCTGAAGACAACAGCAAACTGCAGCTACTTTCCAAATTTTGTCAAATTTCTCCTAATTCCTGGAAAGTCATCTGAGAGTACACACATGGTGGCCGTAAAAGGGTTAAGAAATGCCTTGAGCTGGCCATCATCCATTGCATTACCTGAGGATATTGCATCCAGCCAAATATTAACAGCTCAGGCCATACGTAATACTATGGTATCCTCTGAATCCATAGCAAATGCTTGGATTAAATTAATTTCAAACTGTAATGTTCATTCAGATCACGAAGCATTCAATTTTATTATTATATTAATTCTATTCTCATTATCCGAAGAAAAACAAAAACAGGTGGAGAAGACAATGCGTAAACAAATAAAACTTAATATATTTAAGGAGGATTTATTGGATAAGGCCTTTGAAAAATATAAGCCAATAATCAAAGAATATCTAAAACACATGATATTACTAACAAACTCACTTTTAAAAACACCAGATTCTATGGTTCAATCTTTTGCATCACATATGTATACTCTGATGTTTGACCATCTAGAAGATTCTTGTCAGACAATAGTTGTAGAATTGTTGCAATTTGGATTGAATTGCAAGGATAGTCTTATCAATATATTGGCAATTTTAAACAATGTTGCAGCTAAGAATATGTCTGTATTAAAACAACAAAGTTCACAAATGTTAACACTTTTAGATAGAAAGGATGACATGACCTTGAATGAAATAAGGGCGGTCATGAATTTAGTATGCGGTCTAGCCTACAGCTATGATAACTCAGTAATACGAAGTGATGTTCATATAATAATAAGAAAGTACTTAGGAAGGTCCAACCACACTATTAAATATCATGGAATACTTGCCGGTATTCATGCTGTAAAATATTTAATAGCATTTACTTCTGATGAAGACAGTGATATAAGTTTACCGGAAGATATAAATTATGGCTCCGTGGATTGTCTCCCTGAAGGCAATCTTAGAGAGGCAGCACAGATCATAGAACTTATAAACTGCAGTACCAGGGAGTTTCCTAAAATGATAGCTTTTTTCTATGATGAATTCTGTGAAATAATCAAATCTTCATCCCACATTAACAAACATTTTCTTAAATGGATAACATTGGTTGTGACTAATGATTTGGCACAAAATTATATTGTAAATAATCTACCCCATGAGTCAGTGGGAGAGCTAACTCTGTGCCTGCAGTACTGTCTCAATGCGGAAAGTGAAAAAGATGATGAAATAGCCATTAATATTGCGGGCTTAACATTGGAAGAACAGGAAGATGTAAATATACTAATACTGTCACCTTTGTTCCAATTGGTTCAGACTTTGGATAACTTGGAAGAGAAGGATAATAATTCAACAAACATTTATGCACTAATCGGTTGTCCTGTTGTTATGCCAAAAGTAGACGTAGAAGTCGTGAGGGATGAACTTACTGATAGTTCTATATCAGCTATTCTGGATTGTCTTATTCACTGCGTCAATTGGTTTCGCGAAGTTTTAAACGCATTTTCTGCTGTCCCTGAGCAAAACTTACGAAGTAAGGTTATCAATAGGGTTTTCCATATACAGCAGTTGGAAAGCTTAATTACAGAGATTCTGACAAAGAGTAACCTGACATATCAGCCGCCATCTTGGGCGGGTCATCTAAACACGAGTAATGAGAAAGAGAAACTGGAAAGAACCTTAAAGAAACTGTCAGTTGCTAAACGCAAGAAAAAGAAAGAAGGGGTAACCGACGAATCGATTTTACCAGAAAGTTGTAAGTCGCAAGCTACGCAAAAAAAGACAGCCGGTAATTCCAAAATGGCTTTAACTCACAACATACAATTTAGAGCCCTTGACATAAAGGTCATTGAATTATTAAACGAAGAGTTAACAGAGACTGATTTCGAACAAGCTTTGACAGTCAAAATTGCCACATTTCTTTTAAGTCATATCAATAAAGCCCTTGAAAAGACGTTACACCCGAAGTTGAAAAAAAATATTTTCTCTAACAAACAAGATACCACTGATATTTATGACCCGGTTAAAGCAGAGCAATTTGCTGAATATGTCAACAAAATTATGCCAAAAATTGTTGAACATCTGACATTTGTTACGTCCTGTTTGGAAGCCAGGATGTGCTTCAATGATACCGATCAGGAGAGAGATGAAGACGATGAGCTTATGTATAACGACGAATTATTTGAATACATAAGTCTATTAGAAAACATATTTAATTTTCTAAAAATTTATTTCAAATGGATTGGTTTCAAAAATCGAAACAATCCGCTCTTGCAATCCTCTCTGAAAACATTAGCAAAGTTAGATGATGAAACATCCGTTACGATGCAGGACCTATTGACCAATATTGCGAAAAGTTTACAGAATTACAAAAAGTACTGTGTCTTTCTAAGTACAGCTACATCATTAATTGAACTGTTGAAAACATTACAGGAACATTCTTGCAATCGTTCTATATTGGTTATTTTAAGAGATACAGCAAAATCATTCTTATCAAAACCATGGAAGACTGCAGAGGGCGCAGACGAAAAAGGAGTTCAATTAAATCAAAGTATAGACATATTTGGTAAAGTTTTCTTCGAGAACATTGAAATTGATGACATTAAAGATTGCACTTTGTCAATAATGAATGACGTAGAGGCTCTTAAAAAAGGGCGTTCTCATCTCAATTCTTATAAAAGCATTAATAAAAACAATTTCTCGATATTATTCCGGATCATAGGAAGTTCTTTGCATGATAGAACTAAGCAGAAAGTAAACGAAAATCTAACTAACTCAGAACATTTGGAAGTGTGGGAAAGTGTGTTAGTTACATTGAAGTCTATGGTAGAAATAACAAAAATATTGGAATTCAGAAATATCATGGTGGTATTTTTCAAAAAATCTATCCCTGTTATAAAACTATTTGTGACATACGGAATCCCAATACTGCAAATTGAGTTCAAAAACAATCCGCAAAGAATATTAGGTATATGGAGCGTGTTACAGAAATCAACGAGGTTTCTACAATCCGTATGTTGTCATCTCAAATTGAAGAATGACAAAGTGTTGATGGCTAAAATTCCAACAGTCAAAGAACTGCTGGAGACGCTGATATATAAAGTTAAATCAGTTTTAGCGTCGAATGAGTGTACGGAAGCTTTCGAGATGGGAAACCTTAAAAATAGGAACATCCAAGGGGAAATCATTGCGTCCCAAGAAACTGTGGATGATGTAGAGGTGCAAGACGATTGTGACGACCACTTGCCAGATGACAGCGACTCCGAAGATAATGATCTAGATTTAGGCTTGAAGAGTGCAAGCGAAATGATATAA

Protein sequence:

>DPOGS208689-PA
MSPKRRKTIHEDYFETTLKESGIDLAKPPERCVAKYDIIVITRNLKKILQKHSDYPQNLSEFFDNFVERCQDLEMFKHYLFPNIVRKTTEDQSIQCKNDSIVRILLTIPLLQNKLINYIFEKAIDLAADSKCGPWIKMILRSLCTLDNMIDSDNIATNIISLLDVTYEELVQLEIITTIPDIIGDQAHDKIVIELSKILKQKDHKLIPATLDCLSYLCLSNDQYEELRNETLNILKTTANCSYFPNFVKFLLIPGKSSESTHMVAVKGLRNALSWPSSIALPEDIASSQILTAQAIRNTMVSSESIANAWIKLISNCNVHSDHEAFNFIIILILFSLSEEKQKQVEKTMRKQIKLNIFKEDLLDKAFEKYKPIIKEYLKHMILLTNSLLKTPDSMVQSFASHMYTLMFDHLEDSCQTIVVELLQFGLNCKDSLINILAILNNVAAKNMSVLKQQSSQMLTLLDRKDDMTLNEIRAVMNLVCGLAYSYDNSVIRSDVHIIIRKYLGRSNHTIKYHGILAGIHAVKYLIAFTSDEDSDISLPEDINYGSVDCLPEGNLREAAQIIELINCSTREFPKMIAFFYDEFCEIIKSSSHINKHFLKWITLVVTNDLAQNYIVNNLPHESVGELTLCLQYCLNAESEKDDEIAINIAGLTLEEQEDVNILILSPLFQLVQTLDNLEEKDNNSTNIYALIGCPVVMPKVDVEVVRDELTDSSISAILDCLIHCVNWFREVLNAFSAVPEQNLRSKVINRVFHIQQLESLITEILTKSNLTYQPPSWAGHLNTSNEKEKLERTLKKLSVAKRKKKKEGVTDESILPESCKSQATQKKTAGNSKMALTHNIQFRALDIKVIELLNEELTETDFEQALTVKIATFLLSHINKALEKTLHPKLKKNIFSNKQDTTDIYDPVKAEQFAEYVNKIMPKIVEHLTFVTSCLEARMCFNDTDQERDEDDELMYNDELFEYISLLENIFNFLKIYFKWIGFKNRNNPLLQSSLKTLAKLDDETSVTMQDLLTNIAKSLQNYKKYCVFLSTATSLIELLKTLQEHSCNRSILVILRDTAKSFLSKPWKTAEGADEKGVQLNQSIDIFGKVFFENIEIDDIKDCTLSIMNDVEALKKGRSHLNSYKSINKNNFSILFRIIGSSLHDRTKQKVNENLTNSEHLEVWESVLVTLKSMVEITKILEFRNIMVVFFKKSIPVIKLFVTYGIPILQIEFKNNPQRILGIWSVLQKSTRFLQSVCCHLKLKNDKVLMAKIPTVKELLETLIYKVKSVLASNECTEAFEMGNLKNRNIQGEIIASQETVDDVEVQDDCDDHLPDDSDSEDNDLDLGLKSASEMI-