Monarch geneset OGS2.0

DPOGS212173
TranscriptDPOGS212173-TA1179 bp
ProteinDPOGS212173-PA392 aa
Genomic positionDPSCF300038 + 1051049-1052319
RNAseq coverage458x (Rank: top 27%)
Annotation
HeliconiusHMEL0125620.088.64% 
BombyxBGIBMGA006620-TA0.083.04% 
DrosophilaCG8003-PA4e-11049.75% 
EBI UniRef50UniRef50_D7ELU44e-12656.96%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D7ELU4_TRICA
NCBI RefSeqXP_001648596.11e-12756.60%hypothetical protein AaeL_AAEL004183 [Aedes aegypti]
NCBI nr blastpgi|1571048422e-12656.60%hypothetical protein AaeL_AAEL004183 [Aedes aegypti]
NCBI nr blastxgi|2700159003e-12256.96%hypothetical protein TcasGA2_TC004088 [Tribolium castaneum]
Group
Gene OntologyGO:00082709.3e-10zinc ion binding
GO:00055152.1e-07protein binding
KEGG pathway 
InterPro domain[9-128] IPR0206831.3e-27Ankyrin repeat-containing domain
[326-363] IPR0028939.3e-10Zinc finger, MYND-type
[44-73] IPR0021102.1e-07Ankyrin repeat
Orthology groupMCL15044 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212173-TA
ATGGAAAATAAAAGCGAAGAACAAGCACAACCTGAAAAATCTATTTTCACTGCCATTGCCCAAGGCGACCTTCCCGAATTTAAGAATATCCTCGCACAACATAAGGGGAGCGTCGATTTCTTTGATGAAAATGGCATGACGGCGTTACAGCACGCGGCTTATAAGGGTAACAAGGATATGGTGCAACTCCTTCTCGATAGGGGTGCGGATGTGAATTCTGGCAAACACGAGTACAACTATACAGCTCTTCATTTCGGAGCTCTGTCTGGAAATTCTGATGTTTGTAAGTTACTTTTAGATGCAGGTGCTAAACCAACTGCCACTAACTCTGTAGGACGCAGTGCTTCTCAAATGGCAGCATTTGTTGGTAACCATCACACAGTGGCTACTATCAACAACTATGTACCGGCTAGCGAAATATCCTATTTTTCGGTAGTTCAAGGGCAACAAACAGAACCACATTTACCACCATTCTTGGTTGAATCTCTTCATAAACTTGTTCTCGGTGTAAACATCCACCCCGTCAGACTGGCATTAAACTTACAACACATGTCAGCTCTTTTAGAGAATGCAGAAAAAGTCAGCAAAGTGCTTGAGATGCTTTGCAAAAAGGAAATGACAAGAGGTAGTGATACAAATGAAGTTATGGCATTCAAATATCATTACCTCGCTTATATCCTAAGGGAGATCAATAATATACGAGACAAACAGAAACCTGTAGAAAGCAAGGAGGACAAGAAACATGATGTTATTGAAATTTTTTCTAAGAAGCTATTAAAACCCGGCAAGGATGGTGTGTCATTAGACCTCATGGATTCCTTCCTCAAGGACTGTGTCCGAGAGTTTCCCTACAGAGAATGTACAACATTCCACCAAATGGTAACATCTCTTACTAGTAAAGACCCACCTCCAGCTCTAACAGTGATCAACTGTACAATTAATGGTCAGAGAGGATTTGTTGATGCTATCCCCTACTGCAGCACTTGCGGGGAAGAAAAACCAGCCAAGAAGTGCTCAAAATGCAAAACAGTTCAATATTGCGACCGCGAATGCCAGCGTTTACATTGGTTTGTACACAAGAAGGCATGTAACAGAGAATCAAGTGTCCCCGTCAATTCAAAGCCCAACATCGACCCATCAGAAATTACCGCCGAGTTGCAAAATCTTGTTGCCGGATAA

Protein sequence:

>DPOGS212173-PA
MENKSEEQAQPEKSIFTAIAQGDLPEFKNILAQHKGSVDFFDENGMTALQHAAYKGNKDMVQLLLDRGADVNSGKHEYNYTALHFGALSGNSDVCKLLLDAGAKPTATNSVGRSASQMAAFVGNHHTVATINNYVPASEISYFSVVQGQQTEPHLPPFLVESLHKLVLGVNIHPVRLALNLQHMSALLENAEKVSKVLEMLCKKEMTRGSDTNEVMAFKYHYLAYILREINNIRDKQKPVESKEDKKHDVIEIFSKKLLKPGKDGVSLDLMDSFLKDCVREFPYRECTTFHQMVTSLTSKDPPPALTVINCTINGQRGFVDAIPYCSTCGEEKPAKKCSKCKTVQYCDRECQRLHWFVHKKACNRESSVPVNSKPNIDPSEITAELQNLVAG-