Monarch geneset OGS2.0

DPOGS214584
TranscriptDPOGS214584-TA4416 bp
ProteinDPOGS214584-PA1471 aa
Genomic positionDPSCF300050 - 475478-482538
RNAseq coverage7x (Rank: top 87%)
Annotation
HeliconiusHMEL0076620.063.40% 
BombyxBGIBMGA005150-TA0.064.39% 
DrosophilaCG42534-PC9e-7150.17% 
EBI UniRef50UniRef50_D6WP952e-15461.35%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WP95_TRICA
NCBI RefSeqXP_001807607.12e-15561.23%PREDICTED: similar to CG6599 CG6599-PA, partial [Tribolium castaneum]
NCBI nr blastpgi|1892391803e-15461.23%PREDICTED: similar to CG6599 CG6599-PA, partial [Tribolium castaneum]
NCBI nr blastxgi|2700097890.036.59%hypothetical protein TcasGA2_TC009087 [Tribolium castaneum]
Group
Gene OntologyGO:00055151.4e-06protein binding
KEGG pathway 
InterPro domain[52-486] IPR0206834.5e-65Ankyrin repeat-containing domain
[125-154] IPR0021101.4e-06Ankyrin repeat
Orthology groupMCL14640 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214584-TA
ATGGACAAGCGCGCAGTGAGCGGCCGGGGCGAGCCCGTCCCAAGCATTCTCAAGAAATCAAATAAACATCGTTCCGGTGACCTCGTTGATAAGAGCTCAAAAATAAAACGAGAGAATAAAAGGGGTGATGTGAATATCCCTCTACCTCAAGGTTGCACTCCACTCATGTACGCTTGCCAACAGGCAAACTACAAGTCTGTTATAGAAATCCTTAACAAAGACCCATCATCAGTGCGCATCAGAGACCGTGGTCTACGCTGTGCTCTGCACTACTGTGCTTCGAGCGGTGCTGGTGTGAGCCAGGCGGCTCGGGCGGCTTGTGCCGACAGGGTGCTCATGGCAGCCAGCCACTTGGCTGACGCCCGTGACGCCGACGGCCTGACACCCCTTCATCTGGCAGTGGTTCACGGAAACGTACCACTAGTGCAAACTTTGCTCGCTGCTGGTGCTGATGTCAACGCTAAAGACAATGAACAACATACAGTGGTCCACTGGGCAACAGTTTGTGGCGAGGTCGGCGCTCTGAGAGCTGTGTTAGTATCAGGAGCTGACGCCGCTACGCCTGACCAGCACGGCGGGTATCCTCTACACTACGCTGCTCAGATGTGTGGAGCACCAGCCGCTACTGATCACCAGGGCAGAGGAGCGGCGTTGGAGGTACTTCGTGCATTAGTAAGGGAAGGCGGCGCTCGGGTCGATGTCCGTGATGCTGATGGGAGGACACCTCTATTATGGGCCGCGTCAGCTGGCTCGGCCGCCGCAGTACTAACTTTGCATCAAGCAGGAGCGAGTGTAGATGATGCTGATAGAGATGGCCTGACAGCTTTACACTGTGCTGCGGCAAGAGGACATACTGAAGCACTTGAGACTCTGGTTGGACTGTGCGGTGCTAGGGTTGACGTCGCTGACTCACACGGCTGCACACCGTTACATTACGCAGCAGCTCTAGGTCATGCTGACGCAACCTCAGCCCTACTGGTTCACGGCGCGGACGCACATCGACAGGATCGTAGAGGGAGAAGTCCTGCTCATACCGCTGCAGCCAAAGGGCAAATAGAAACAGTTCGCATACTAGGCGCAAGGGGAACAAATTTATGGTTAAGAAATTCTAAGGGAGATCTTCCATTACACGAGGCCGTTGCTTCGGGTCGAAGGGAATTAGTGAAATGGCTTTTAGATGGTAGACCATCACAAGTCAATGCTACAAACCACGAAGGCCGAACTCCTTTGCATATAGCAGCAGCAACTGATAATGCTGACTTGTGTCGTCTACTTATGGATCGTGGAGCTGAGGTAAACCCTGTCGCTAAGTCATCCAAGAACGAACCCTTAACTCCCTTAGATTGTGCTATAAGTAGAGGTCATAGAGCTACTGCCAAATACCTACAAATGCACGGTGGTTTACCAGCTACTAAACTTTCAAATACTGAAATTATAATAGATGGTTCCTCTATCACAGCTTTACCAACAAGAAAAGTAACAAGTACAAAGATTGATGTTCGAGATAGAATTTTTATAGAAAAACGTGAAGTTGTTGAACTCTCAAGTCCGATCACTAATAGAAAGAAAGTAATAACTCGAAGCGATTCAGATTTCGATAAGAGTTCATCTTCAGATGAACGTAAAGTAAAAAGAAGAACTAGAAATAAATATGCGGATATTTATGCTAAGCGTAGAAAGAGGTTATCAGAAAAACAGAAAAGTTTTAGCGATGGGTGTGACAGTGAATTTGAAGAAACAAAGAAACGTGATCATTATAAACACATTATAGTAAAGCCAAAGGAAAGTAAAAGTAGATCGAAAAGTGAACCCTCTAGAAAAAGTAAAAGTAGTAAGAGTCGTGAACGTTATCACAGATCTACCTCAGAAAGTTCATCTGAAAGCTATTCCGCTAGAAGGAGAAGTAAAAGGCATCGAAAAAGAAATAAAAGGAAAACTACGTCATCATCAGAAACCACCGCATCAGATATTAGTAGACGAAGAAGTTCGAGGCATAGAGAAAAAAAAACTTCTATTCATATTGAAAACGACGACAGGAAGCAAAGTGTTAACATTATCAAATACAAAAACATAGATGATTCTCGGGAAAAAGATAAAACAGTTGAAAGTGCTTCAGCAATATTGCCGTCACAAAGAACGAAAAGCGAAGAAGAATCAACAAAAAGTCAAGATATAACTAAAAAAAAAACTTATAGTGAAACAGAAACTGATACTGTGTCTGTTAAAACGAATATGGTAATAACAGAAGCCCAGATCCATATGGAACGTGAATCGAGTCAGCATGGCAGTGCAGAAATGACAGTTACTATGGATTCACTAAATAATGTTTCAATAGAGACAAGTAACTTATCTGTTACTCATAAAGATAAGAATGAAGACGAGGAAAAGAAATCCGAACCACAAGTCCCAACAGAGGAGGTAACTGAAGTAGATAAACCTATCGATGAGAGCATTGAATCAAAAGAAGAAAATAAAGTAGTTATTGAGAGTTCTGTAGAAGATGCTAAAAGTATGCATCGTACAACATCATTAGATGATTCAAAACAAAGTGAAACTAAAAATAACATATCTGATAAAGATACTCAAGAGACTTCAGAGGAACTAAAAACAAAGTCTCCATCTGTCGATAATGAAATTCCTGAAAAGGAGGCTTCCCTCACCGATAAAGAAATTAAAAGCATAGATAAATCTCAAGACGATAGTTTGACAAATAGTCAAAAACAATCTTTTCAAGTTTTGTCTGGTCCCAAGGATAGCAAAACGGAAGAAGGCGCTTTAAAATCATCGGGAGAAAATGAGTCTACAAAAAAAGATGATGGAGAAGCGCTCCCTAAAGTTTCTTCTGCAGTATCATTTTCAGAAAAAGATGAAATTATTAAAATCAACGATAACGAAACAATTGCAGATCAATCAGATAATACAGAAACTTCCAATAAGATTATAGACGATGAAATAACAAAAACACCCCACGAAATCACAACTGTCTCTGAAGAAAAACCAGAATTGCAGACTGAGGCTAGAATTGGGGCCGCTGAGTTACATGCGGCCGCAGATAAAGGGATAATTAAAATTTTACCTGATGATTCAACTATGGAGGACGTAGAAAATGAAAATTTAGAACCTGATAAGTTTGTTGAATATCAAGACTCCTCTGTGACACCTATTTCGCCCAGACGAATCTCAAAATCATTAAAAGATAGTCAAAGTAGTAGCAGAAAGAGTAGCATCTATGAAACTGAAAGTTACAAGGTTTTGTCAGAAACTGTCGAAGCTGATGAAGTTAGTACAGGTATTTTAAAGAAATCAAGTTTTATTGACGATGAAAATATAGAAGGTGATACACTTGATAACACATTGAGAAGAGATAATTCATTTAGTCGAATTCCGAGCGTCAGTGACAATGAGATTTATTACAGTCATTCAGAAGTTAATGGCAGAAGAAAACGATTTCGAAAAAAAGGAAAAATTAAAAGTCGATTATCTTTGAGATCAAAGAGTGAAAATTCTGAAAGAGATTACGAGTCCAGTGGCTTCATGGACTCTGGTTTTGAACCTAGTCCACGGTTAGTTCAAAGACGGATTATGAGTCCTCGACTACAGGCTTACTATCAACAACGAAGAAGTGCAAAATTAACGGGAAAAGTAGATAGCAAAATACCGGTCAGAAAACCTGGAGATAAAAAGGCTGTCGATATGAGATCCGTGACGCAGCGGTTGCAAACAAACATGAGAAGATACTACTGTGAAAGAAAAATATTTCAGCATTTATTAGAGTTAAAACGATTGCAAATAAGAACAAGCAAAGCAAACGAAGCTGTTCTAGTTAAAAGAACAATAGACGAATACAATAAGTCTAGTCTGGCCACTGTCGGCTTAGGTCCATATAATAGCACAGATTACTCATTTAGTACTTTTGAAAAATTTTTGTATGAGAGCTTGAGAAAACTACAGAAAAGTGGCAAAAAACATCTCGACAATTTGCCTGAAAGACCCATTGATTTTGATTATGGTGAGACCGAATTGTATAAAATGACCGGCATACCTGATAACCCTTGCTTATGTACAAGCAAGACACACCGATGTTTCCACGCTGTACACGCGTACACCGGTATTCCGTGCTCTGCTTACATTCCTTACAAATGGAATCATCACACGATGCCGAAACCTGCTACTGCAGTGTCAAAGACGAGGTCTAAAGGATTTTTACCTAAAATCAATTCGAAGCCACCATCTGGTAAAGCTCACGTCACACTTGAAGTCTCCCACGGAACCGAAAGGCAGCTGATTGCATTACCCGCTGAAAAACTTGATAAAAACAAAAGATATTACGTAACTTTCACTGTCAAAGGTTCTGAACCACCCTCTGATAATGATAACTCCTCACCAAAAACATCTAAATCGCCAAAAAGTGGCTGA

Protein sequence:

>DPOGS214584-PA
MDKRAVSGRGEPVPSILKKSNKHRSGDLVDKSSKIKRENKRGDVNIPLPQGCTPLMYACQQANYKSVIEILNKDPSSVRIRDRGLRCALHYCASSGAGVSQAARAACADRVLMAASHLADARDADGLTPLHLAVVHGNVPLVQTLLAAGADVNAKDNEQHTVVHWATVCGEVGALRAVLVSGADAATPDQHGGYPLHYAAQMCGAPAATDHQGRGAALEVLRALVREGGARVDVRDADGRTPLLWAASAGSAAAVLTLHQAGASVDDADRDGLTALHCAAARGHTEALETLVGLCGARVDVADSHGCTPLHYAAALGHADATSALLVHGADAHRQDRRGRSPAHTAAAKGQIETVRILGARGTNLWLRNSKGDLPLHEAVASGRRELVKWLLDGRPSQVNATNHEGRTPLHIAAATDNADLCRLLMDRGAEVNPVAKSSKNEPLTPLDCAISRGHRATAKYLQMHGGLPATKLSNTEIIIDGSSITALPTRKVTSTKIDVRDRIFIEKREVVELSSPITNRKKVITRSDSDFDKSSSSDERKVKRRTRNKYADIYAKRRKRLSEKQKSFSDGCDSEFEETKKRDHYKHIIVKPKESKSRSKSEPSRKSKSSKSRERYHRSTSESSSESYSARRRSKRHRKRNKRKTTSSSETTASDISRRRSSRHREKKTSIHIENDDRKQSVNIIKYKNIDDSREKDKTVESASAILPSQRTKSEEESTKSQDITKKKTYSETETDTVSVKTNMVITEAQIHMERESSQHGSAEMTVTMDSLNNVSIETSNLSVTHKDKNEDEEKKSEPQVPTEEVTEVDKPIDESIESKEENKVVIESSVEDAKSMHRTTSLDDSKQSETKNNISDKDTQETSEELKTKSPSVDNEIPEKEASLTDKEIKSIDKSQDDSLTNSQKQSFQVLSGPKDSKTEEGALKSSGENESTKKDDGEALPKVSSAVSFSEKDEIIKINDNETIADQSDNTETSNKIIDDEITKTPHEITTVSEEKPELQTEARIGAAELHAAADKGIIKILPDDSTMEDVENENLEPDKFVEYQDSSVTPISPRRISKSLKDSQSSSRKSSIYETESYKVLSETVEADEVSTGILKKSSFIDDENIEGDTLDNTLRRDNSFSRIPSVSDNEIYYSHSEVNGRRKRFRKKGKIKSRLSLRSKSENSERDYESSGFMDSGFEPSPRLVQRRIMSPRLQAYYQQRRSAKLTGKVDSKIPVRKPGDKKAVDMRSVTQRLQTNMRRYYCERKIFQHLLELKRLQIRTSKANEAVLVKRTIDEYNKSSLATVGLGPYNSTDYSFSTFEKFLYESLRKLQKSGKKHLDNLPERPIDFDYGETELYKMTGIPDNPCLCTSKTHRCFHAVHAYTGIPCSAYIPYKWNHHTMPKPATAVSKTRSKGFLPKINSKPPSGKAHVTLEVSHGTERQLIALPAEKLDKNKRYYVTFTVKGSEPPSDNDNSSPKTSKSPKSG-