Monarch geneset OGS2.0

DPOGS200402
TranscriptDPOGS200402-TA1407 bp
ProteinDPOGS200402-PA468 aa
Genomic positionDPSCF300236 - 577236-580520
RNAseq coverage6x (Rank: top 87%)
Annotation
HeliconiusHMEL0147352e-3368.89% 
BombyxBGIBMGA008979-TA6e-7864.25% 
DrosophilaProsap-PA5e-1031.14% 
EBI UniRef50UniRef50_C3NIT93e-1329.07%Ankyrin n=7 Tax=root RepID=C3NIT9_SULIN
NCBI RefSeqXP_001122606.11e-1732.97%PREDICTED: similar to Ank2 CG7462-PC, isoform C [Apis mellifera]
NCBI nr blastpgi|1234046948e-1534.22%ankyrin repeat protein [Trichomonas vaginalis G3]
NCBI nr blastxgi|1234046944e-1534.22%ankyrin repeat protein [Trichomonas vaginalis G3]
Group
Gene OntologyGO:00065209e-14cellular amino acid metabolic process
GO:00055154e-05protein binding
KEGG pathway 
InterPro domain[268-444] IPR0206832.6e-31Ankyrin repeat-containing domain
[270-444] IPR0060349e-14Asparaginase/glutaminase
Orthology groupMCL25560 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200402-TA
ATGGAAAGTGATCCCGACCCTATACTAAGTAGAACATCGGAAAATATCTCTCTACAAAGCAATATTGCATCTGGAAACAGTTCTGTTACTACTGAAGATGTCGAAAACACGAAAAGCAATACTAAAGATGACATCAGCTTACAATGTACATCAACCCAAAACCACACAAACACAAACGTGGACAGAGAACCAGTTTTTGTATTAGTGGACTCAAGATGTGTAAGCAGTCCCAGCTCCAGGGAAATCCTTATTAATAAAATAAACGTTGCACAAGATTCGACTAACAAGAGTCCGAAAAATAAGGTCATACATGTCGATACTCCTTGCACAAGCACAGCATATGCGGAATATATAACGATCCAAAGTCGGACATCCGATTCCATTGATGAATTGGAAATGATTGATATAAATTTACAAAGCGACAATGAAATTCCCATCGATACAAAACCAGATTCAATTGTTGTGGCATATTTACGAGATAAAAGCGAAGTCATAGATAAAGATGAACCAAGTTCGTCTAAAAAAATGTCGTTGCTCGATAACCTATCCAAAATGGAAACAATACCGGAAACTGAGCGGCTTTCTACATTTTCAATTCCATCAAATTCAAGACATGAAAGTATATCAGAAACCTCGTTGGATGTACACATACCTTCATATCCTGGGTCACCTCGATCTATTGACTTCAACTCAAGTAGTTCAATAGAATCCATAACCGTGCGTAACCCACAATTAAGAGACGCTATTGAGTTTCTGCATCAGGACAAGGAATTTCTTATAGCTGCTGAAACGGGAAATGATAAGCTCCTTGCAAAACAGGGAACTGACATACATCAATTCGATCACATCGGAAGAAGTGCTTTACACTTGGCAGTCTGCTCTGATAACACGAATGCTGTCAAAATGCTGCTGGAAGCTGGTCTTAATCCAAATATTAAAGATAATTTAGGGATGACTCCACTTTCGCTATCTTTGATGAGGAGGCCATCCACTGTTGTAGCTAATCTTCTTTTCGACCACGGAGCAGTGTTGATGCCACGAACAGACCCGATGGACACCGGCTTATTTATTCAATTTGTAATGATGTGCACACCCACATCTGAAGAAGAGAATATTCTACGACTACTTGTAGATAAAGGGGCTGTAATAAATGATACTGACGCTCCGGGACAGCGACAAGCCCTCCATTTTGCGGCTATGAGCAATAACGTGAACTTAATCCGCATTTTAGTGGGCCTGGGTGCGGATTTGTATTTGAAAAATCATAGAGGAGAAACGCCAAAAGACGTAGCCGAAATATTTCATTGTAGGGAAGCATTTGACTTACTGAACTATCTTGAAGAAATCGAAGAAGTAGCAATAACCGCTAATTCGAATACTTTACTGTTTGAAATCCCTGAAAAATAA

Protein sequence:

>DPOGS200402-PA
MESDPDPILSRTSENISLQSNIASGNSSVTTEDVENTKSNTKDDISLQCTSTQNHTNTNVDREPVFVLVDSRCVSSPSSREILINKINVAQDSTNKSPKNKVIHVDTPCTSTAYAEYITIQSRTSDSIDELEMIDINLQSDNEIPIDTKPDSIVVAYLRDKSEVIDKDEPSSSKKMSLLDNLSKMETIPETERLSTFSIPSNSRHESISETSLDVHIPSYPGSPRSIDFNSSSSIESITVRNPQLRDAIEFLHQDKEFLIAAETGNDKLLAKQGTDIHQFDHIGRSALHLAVCSDNTNAVKMLLEAGLNPNIKDNLGMTPLSLSLMRRPSTVVANLLFDHGAVLMPRTDPMDTGLFIQFVMMCTPTSEEENILRLLVDKGAVINDTDAPGQRQALHFAAMSNNVNLIRILVGLGADLYLKNHRGETPKDVAEIFHCREAFDLLNYLEEIEEVAITANSNTLLFEIPEK-