Monarch geneset OGS2.0

DPOGS206570
TranscriptDPOGS206570-TA1230 bp
ProteinDPOGS206570-PA409 aa
Genomic positionDPSCF300108 - 318530-321406
RNAseq coverage87x (Rank: top 63%)
Annotation
HeliconiusHMEL0153884e-14460.14% 
BombyxBGIBMGA012176-TA4e-2332.12% 
DrosophilaCG8319-PA1e-2227.50% 
EBI UniRef50UniRef50_D6WRY26e-2126.72%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WRY2_TRICA
NCBI RefSeqXP_002096976.14e-2327.02%GE25969 [Drosophila yakuba]
NCBI nr blastpgi|1954995038e-2227.02%GE25969 [Drosophila yakuba]
NCBI nr blastxgi|1955722143e-2927.27%GD18633 [Drosophila simulans]
Group
Gene OntologyGO:00036764.4e-11nucleic acid binding
GO:00056344.7e-10nucleus
GO:00082704.7e-10zinc ion binding
KEGG pathway 
InterPro domain[292-334] IPR0130874.4e-11Zinc finger, C2H2-type/integrase, DNA-binding
[11-88] IPR0129344.7e-10Zinc finger, AD-type
Orthology groupMCL34659 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206570-TA
ATGGAGACCTACGATAAAATTTACAAATCAATATGTCGTTTGTGTTTAAACTATAGCGTCAGTGAAAAAATGGTGCCTTTAATTGATAGTAAGAATGAAGACGGTCTCAGCGTATACGGAAAAGCGGTCCTTAGGTTCGCTAAAATATCCATTGACAAGCAAGATTGTTTACCGATTGCTATGTGTATAAAATGCTTGCAATTACTTAAACAAGCGATATTTTTTAAATTCATGTGTGAATCGAATGACTCCTGCCTAAATAAATTAATATCCACTGACGATTATAAACAAAAGATAGTTGAGTACACAATGTTAAGGTTTTATTTCCCAAATGAAAATTTAAATACTAAAAAGCGTGTGCGTAAAAAAGAGCAACAACTAAAAAAACCATCGGAAAAGACGAGAAATGAAGTGACTCTTACTTGTGGAAAAAACAATAATGATTTGAAATCTGTACAAGACCGCCTCCATAGTGTGCAAATTGAAAGTGATTGTGAAGAAAATGTGTTAGAAAAAATGGAAAATCTAATAGACTCTAATGTGTGTGATACTATAGAGTTACGGAAACCATTAAAGAGGAAATTGAGAAGGAAAAAATTGGAAAACAAGAGACGTAGAATGCTTCTTATACAACAGAGATTATCGCAGAAAAATACACAAGCTGGAAATTTAGTTTGTGGTATATGTAATAAAGTATTAGCGAACCAACACACATACGACCATCATATGCAACGTCACAACGGATGTAGGTATATTTGCGAGCACTGCGGCAAAGGATTCCCAGTGAAGACTGAACTCCAAATTCACCAAGTTTCTAAACACGGGACCGGTCCATATTTACAATGTTCGCATTGTCCTTTCAAAGCCCCCAGGAAATTCGATTTAATAGAACACGAAAGAATCCATTCAGGCGAACGGCCCTATACTTGCGAGAAGTGCGGATTAACATTCCGCAGGCGTGGTATATGGAAGAAACACCTAATATATCACACGGAAAAGAAAATACAATGTCCACGGTGTCCCAAAAAGTTTTTCCAGCGCAGCGAAATGTTGGCCCATGCTAATAACGTACACGATAGAGTTTACGTATATTTATGCAGCAAGTGTGGTGCGACGTACGCAAAAACTGCTACAGTTAGGCGACATATGACCGAAAGGCATGGTATTCCGCGTGAGATGCAAGGTAAAGTCGTGAGAATTAATAAGGCTGCTGGTCTTCAGGAACAATAA

Protein sequence:

>DPOGS206570-PA
METYDKIYKSICRLCLNYSVSEKMVPLIDSKNEDGLSVYGKAVLRFAKISIDKQDCLPIAMCIKCLQLLKQAIFFKFMCESNDSCLNKLISTDDYKQKIVEYTMLRFYFPNENLNTKKRVRKKEQQLKKPSEKTRNEVTLTCGKNNNDLKSVQDRLHSVQIESDCEENVLEKMENLIDSNVCDTIELRKPLKRKLRRKKLENKRRRMLLIQQRLSQKNTQAGNLVCGICNKVLANQHTYDHHMQRHNGCRYICEHCGKGFPVKTELQIHQVSKHGTGPYLQCSHCPFKAPRKFDLIEHERIHSGERPYTCEKCGLTFRRRGIWKKHLIYHTEKKIQCPRCPKKFFQRSEMLAHANNVHDRVYVYLCSKCGATYAKTATVRRHMTERHGIPREMQGKVVRINKAAGLQEQ-