Monarch geneset OGS2.0

DPOGS205508
TranscriptDPOGS205508-TA2937 bp
ProteinDPOGS205508-PA978 aa
Genomic positionDPSCF300056 - 253722-262481
RNAseq coverage428x (Rank: top 29%)
Annotation
HeliconiusHMEL0112950.073.12% 
BombyxBGIBMGA000140-TA0.064.05% 
DrosophilaCG7839-PA2e-9831.08% 
EBI UniRef50UniRef50_D6WGR94e-18044.50%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WGR9_TRICA
NCBI RefSeqXP_968241.10.044.52%PREDICTED: similar to CCAAT/enhancer binding protein zeta [Tribolium castaneum]
NCBI nr blastpgi|2700031741e-17944.50%hypothetical protein TcasGA2_TC002139 [Tribolium castaneum]
NCBI nr blastxgi|2700031740.042.63%hypothetical protein TcasGA2_TC002139 [Tribolium castaneum]
Group
Gene OntologyGO:00054881.2e-11binding
KEGG pathway 
InterPro domain[389-650] IPR0056125e-35CCAAT-binding factor
[179-506] IPR0160241.2e-11Armadillo-type fold
Orthology groupMCL12154 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205508-TA
ATGAAGCACAAAAGATTCGAAGAGAACATTTTAGTAGACGAAGCGTATACGAATTACGGTAAAGTTGATAAAAATGCTGATGAAAAAAAAGGTTATGGTATAGCAAAACATTTCGCTGATACTCTAGAATACGCGGACAAAAAGAAATGGTATCAGCAGCTACCAGAAGAGCCTTCGACACAAGGATCGATAACGCAAGAAAAAATTGAAGAACTCCGTAAAGAAGCGAGTAGTGCTCTACATGGTGACACATTAGCTTATGAAACAAAATCAAGTAAAAGTGGTTCATCCGACCAACAGTGGGCTCGTACACTATTAACCAAAGGTACAATTGGAGACAGAGTGGCCGCAGCAACAATACTAATACAGGACAATCCGCTGTATAATTTGACGGCATTGAGGAATTTAATCAATAATGTGAAGCCAGCTAAGAAAAAGGATGGAATAGTTATAATAGATGCTGTATCGGAACTGTTAGTATCAGAGCTGCTCATCCCTGACACCAAGCTGCGCACCTTCCAGCAGCATCCTCTACATGTCATTGATGAAATCACATCGGGCAACAAACAGGCAAGACGGAATGTTCTAAAACTATGGTACTATGAGGATCAACTTAAAGAGTTGTACGGAACTTATGTGGAAGCATTAAACAAGTTTGCTCATGACACAGTGGACGCTAATAAGGAAAAGTCTATCAGTGCCATGTCTTACCTCTTGATGCATCATCCGGAGAGAGAAAAGATGCTCCTAACAAATATAATAAATAAACTCGGTGATCCGAGCCAGACGGTGGCATCGAAAGTGATCTACCATCTCTGTCAACTTCTATACAACCATCCGAATATGAAATCTGTTGTTTTAGCTGAGATAGAGAAAATGTTGTTCAGATCTAACATATCCCCACGCGCTCAATACTATGGAGTGTGTTTCCTGAACCAGTTCTTCCTTGGTAAGGATGACAGTCGGATAGCTGAGAACTTGATCAGAATATACTTCTCATTCTTCAAGGCTTCTATCAAGAAGGGTGAAATAGATTCTCGTCTCATGTCAGCTATCCTGACGGGTGTGAAGCGAGCCTATCCCTTTGCTGACAGGGAGCGGTTGGTTGAGGCCTCCCAGCATGTAGACGCTGTACACCGACTGGTCCACCTGGCCAACATCAACGTGGCGATCCATGCACTGGCCTTGCTGTATCACATCAGTGATGCTAACAAAGGGACATCCGACAGATACTACACAGCCTTGTACCGGAAACTGACAAATTCCAATATATTCAATACTACCCACTCTGCATTGTTCTTCTCTCTCATATACAAGTCGTTGAAGCAGGACAAGGATATAGACCGGGTGACGTCATTCATCAAGAGATTATTACAGCTGTCCTGCTACATGAGCCCTGGCCAGGCTTGCGGAATGCTCTTCCTCATCTCGCAAGTATTGAAGAGTGATGATAAGAGAGAGGCTGTAAAACTGGTCTTCAGTGAGATTAAAGAGGAAATTAAAGAAGAAAATGAAACTAAAAATAATGATGAAAATCCAGAAGAATTAATGCATTCAGAAGTTGAATTAGATGAGAGTAAGGAAGATGCTGAGGAAAATGTCAAACAGAAAAAAATTGATCTCTTAATAGGAGATAAGAAAGATTTATTAATGGATGATGAAGAAGAGACATATGTTGACCTCAAAATAGACGATGAAGGTAACATAAAGCCTAAGAAGAGGAATACGAACTCTGTGACTGGGTGGTTTCATGCTAGAGTTGACAAGAAAGATGTACAAGAAAAAAACGTTGAGAAACAGTTGAAGAAAGCTATTAATATTGGAAAGACGATAACCAGTTATAGTCCACTGTGCCGTGACCCTCGTTTCACCGGAGCACACCTGACGGCGATGGCTGAACTGACAATGCTGATGAAACATCATCATCCGAGTGTCAAGATGTTTGCTGAAAAATTACTGAATAATCAAATAATCCAATATGGCGGCGATCCTTTGAAGGACTTTTCCGGTATCCGTTTCCTGGATAGATTCGTGTTCAAGAATCCAAAGAAACGTGCCGAGGTCACTGATGGGGAGGTCAAAAAGGTTAAGGGGTCACATCCGAAGTTCGCTGTTAGAAAGAACTATACAGCTAAAGGCATCAGAAGTATCGCTGTCAATTCATCGGCATATTTGAATGAGGATGTCAAGAAAATTCCTGTCGATGAAAGATTCCTATATGATTTCCTTCAAAAGCGCCGAGCGGCTGCTGATAGTGATGAGGAGAGTGACAACGACTCGGTGACCAGCGAAGATTTTGAGACCTATTTGGATTCAGTCACTGGAACCAAAGCACAGGAATCCGATGAGGAGTTAGATTATTTGGGTGAATTGGAGTCGAGTAAACAGAAACGACCGAAGGAAGTTGATGATGAGAAAGATGAGGTGATGAGCGATGATCAAGATGAAGACGATGATAGCGATGGCGAACTCAATATATCCGGTGATGAAGACGAGCCAGTACTATCCGGAGACGAGGACGAACTAATGTTAGAAGACAGCGAAGAAGAAGACCAGATAGATATACCAGGAAAGAAGTCCAAAAAGGATGCTATTAAATTAAAAGGTCACGAAAATCTTGGGTCACTGTTTGCATCGGCCGAAGAGTTCTCGACGCTTCTAGAAGAGACGGCAGCGAATAAAAAACAAGGTTCAAGCCAAGCGGTATCAAACACAGACAATTCAAGCACAAAACAACTGGCTTGGGAGGAAAAACGCGATAGGTGGATCAAAGGATACAATAAGAAGATATTGGGACATAAGAGCAAGGGCAAAAAATTCAATAGCAAAAATAACAAAAATGTCAAAGGCACAAAAATGGCTGATAAAAATATTGGCGGGAAACGAAAAGGCGGAAAAACTGACGGCGCCGGCGGAAAGAAGAAGAAAACAAAATAA

Protein sequence:

>DPOGS205508-PA
MKHKRFEENILVDEAYTNYGKVDKNADEKKGYGIAKHFADTLEYADKKKWYQQLPEEPSTQGSITQEKIEELRKEASSALHGDTLAYETKSSKSGSSDQQWARTLLTKGTIGDRVAAATILIQDNPLYNLTALRNLINNVKPAKKKDGIVIIDAVSELLVSELLIPDTKLRTFQQHPLHVIDEITSGNKQARRNVLKLWYYEDQLKELYGTYVEALNKFAHDTVDANKEKSISAMSYLLMHHPEREKMLLTNIINKLGDPSQTVASKVIYHLCQLLYNHPNMKSVVLAEIEKMLFRSNISPRAQYYGVCFLNQFFLGKDDSRIAENLIRIYFSFFKASIKKGEIDSRLMSAILTGVKRAYPFADRERLVEASQHVDAVHRLVHLANINVAIHALALLYHISDANKGTSDRYYTALYRKLTNSNIFNTTHSALFFSLIYKSLKQDKDIDRVTSFIKRLLQLSCYMSPGQACGMLFLISQVLKSDDKREAVKLVFSEIKEEIKEENETKNNDENPEELMHSEVELDESKEDAEENVKQKKIDLLIGDKKDLLMDDEEETYVDLKIDDEGNIKPKKRNTNSVTGWFHARVDKKDVQEKNVEKQLKKAINIGKTITSYSPLCRDPRFTGAHLTAMAELTMLMKHHHPSVKMFAEKLLNNQIIQYGGDPLKDFSGIRFLDRFVFKNPKKRAEVTDGEVKKVKGSHPKFAVRKNYTAKGIRSIAVNSSAYLNEDVKKIPVDERFLYDFLQKRRAAADSDEESDNDSVTSEDFETYLDSVTGTKAQESDEELDYLGELESSKQKRPKEVDDEKDEVMSDDQDEDDDSDGELNISGDEDEPVLSGDEDELMLEDSEEEDQIDIPGKKSKKDAIKLKGHENLGSLFASAEEFSTLLEETAANKKQGSSQAVSNTDNSSTKQLAWEEKRDRWIKGYNKKILGHKSKGKKFNSKNNKNVKGTKMADKNIGGKRKGGKTDGAGGKKKKTK-