Monarch geneset OGS2.0

DPOGS210234
TranscriptDPOGS210234-TA1449 bp
ProteinDPOGS210234-PA482 aa
Genomic positionDPSCF300196 - 232118-235641
RNAseq coverage167x (Rank: top 51%)
Annotation
HeliconiusHMEL0204482e-10759.43% 
BombyxBGIBMGA002373-TA5e-2957.39% 
DrosophilaCG32343-PB3e-4434.52% 
EBI UniRef50UniRef50_E2AG152e-5537.91%GA-binding protein subunit beta-2 n=5 Tax=Formicidae RepID=E2AG15_CAMFO
NCBI RefSeqXP_316418.42e-5641.23%AGAP006384-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3838597842e-5642.45%PREDICTED: neurogenic locus notch homolog protein 2-like [Megachile rotundata]
NCBI nr blastxgi|1582957842e-5336.48%AGAP006384-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00055151.5e-08protein binding
KEGG pathway 
InterPro domain[43-195] IPR0206831.5e-41Ankyrin repeat-containing domain
[143-171] IPR0021101.5e-08Ankyrin repeat
Orthology groupMCL16123 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210234-TA
ATGACTTCAGATCTCTGTGAAGAAGATGTAGTCGTTCGATTATCCTCACCTCATATAATATCATCAGAGACGATAGTACCAGCTGCCAGTAATGTTGGCGGTGGACGAATACAGACCGGAGGAGTGGAGCTGGGTCGCAGACTGCTCTTAGCAGCCAGAGCAGGAGATACCGCTACTGTACTTGATCTCATGGCCAAAGGTGCACCATTTACCACTGACTGGCTGGGTACATCACCGCTGCACCTGGCTGCTGCCAACAACCATGTGGAGACATGCGGTGTATTACTGAGGGCGGGTGTGTCTCGGGATGCTCGGACTAAAGTTGAACGAACACCGCTGCACCTGGCCGCACATGCTGGGCATGCCGCTGTAGTTGCACTGCTGCTCGACCATGGAGCTATGGTGGACTGTCGCGACATGCTCCACATGACGCCGCTGCACTGGGCGAGTGCTCGAGGTCACGTGGCCGTGGTCCGCGAGCTAGTGTGTCGCGGCGCGGATTTGCTCGCTCGCTGCAAGTTCAGGAAGACGCCGCGCTGCCTCGCCGTCCGCGCCGGGGCCAGTGACGTCATGGCTGTCCTCGACCAAGCTGCCAAGGAACACGACCGACCCACAGTGACTGAGGAAACGCCAAAGATTCAACATTTTGAAACAATCCAAAGACTACAGGAGGTCAGACAGCAGACCAAAACCAAGCCTCCGGAGAAGACTATCGTAATAGAATCTAAGACTGAGCCGGCGTCGGGTCTGTCCGGGGCGGCGTTACTCCGCGCACACGGCATCACTCTCCTACCCCGGGACCGCGGCTCCACTGTACTCAGCGCACTGAGGAGCGGACGGACCGTCGTACTGTCCGATGCCGGGAAGCTGATGTTGAAGGAGAGCACCAACGCCCCGGTGATGGTCAGCGCCACCAGCGCCTCTGTGGACGCGAGCAACAACACAGCCAGCAACAGTCAGTCAAGCTTGCCCACAACTAACATAGTGACCAGTTCAAACATCACCGACGCTAAAGGGGTCATGGTCCGAGCGAGGACTCTCAACACCATCAAGGGCGTCAAAGGCTTGCAAATGCTCTCCGTCAACAGATCCGACCACACTGTTAAGAAGGTCATCAGTTCACATGACTTGCAGAAAGTTAAATTACTCGGCGTGAAAGAGAACAAGTCACCCCGCCGTCCAGCTCTCAAGATCCTTCTCAACAAAGCCAACCTCACACGACTACTAGCCAACACCACTAACGCTTCTACCACCAACAACACACAGATATCGATCGAGCCTTCCGGCGAGCTGAGCGAGTCGCCGGTTCAAAGTGACGCGGTGATGGAGGACGCGTCGGAATCGTCTCTGAGGGTTCAACTGCAACAAGCGCACGCCGCCCTGGCCAGCCTGGCCGCAGAGTTACGACACTGTAAGGCTAAACTGGCCAAATACGAACACACGCACTGA

Protein sequence:

>DPOGS210234-PA
MTSDLCEEDVVVRLSSPHIISSETIVPAASNVGGGRIQTGGVELGRRLLLAARAGDTATVLDLMAKGAPFTTDWLGTSPLHLAAANNHVETCGVLLRAGVSRDARTKVERTPLHLAAHAGHAAVVALLLDHGAMVDCRDMLHMTPLHWASARGHVAVVRELVCRGADLLARCKFRKTPRCLAVRAGASDVMAVLDQAAKEHDRPTVTEETPKIQHFETIQRLQEVRQQTKTKPPEKTIVIESKTEPASGLSGAALLRAHGITLLPRDRGSTVLSALRSGRTVVLSDAGKLMLKESTNAPVMVSATSASVDASNNTASNSQSSLPTTNIVTSSNITDAKGVMVRARTLNTIKGVKGLQMLSVNRSDHTVKKVISSHDLQKVKLLGVKENKSPRRPALKILLNKANLTRLLANTTNASTTNNTQISIEPSGELSESPVQSDAVMEDASESSLRVQLQQAHAALASLAAELRHCKAKLAKYEHTH-