Monarch geneset OGS2.0

DPOGS202179
TranscriptDPOGS202179-TA1257 bp
ProteinDPOGS202179-PA418 aa
Genomic positionDPSCF300162 + 292706-295906
RNAseq coverage162x (Rank: top 52%)
Annotation
HeliconiusHMEL0108934e-5547.22% 
BombyxBGIBMGA003320-TA5e-5435.04% 
Drosophilaham-PB5e-0730.61% 
EBI UniRef50UniRef50_F4W4816e-1024.22%Serendipity locus protein H-1 n=7 Tax=Formicidae RepID=F4W481_ACREC
NCBI RefSeqXP_001843775.12e-0625.77%zinc finger protein 449 [Culex quinquefasciatus]
NCBI nr blastpgi|3320316052e-0924.22%Serendipity locus protein H-1 [Acromyrmex echinatior]
NCBI nr blastxgi|3320316058e-1724.29%Serendipity locus protein H-1 [Acromyrmex echinatior]
Group
Gene OntologyGO:00056347e-08nucleus
GO:00082707e-08zinc ion binding
KEGG pathway 
InterPro domain[20-100] IPR0129347e-08Zinc finger, AD-type
Orthology groupMCL20684 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202179-TA
ATGACTGATCAAGAATGTAATGATACGACAAAGGAAGATATTTACCCCGTCGCTGAATGCCGCGCGTGCCTGCAAGTCTTATATTCCGATTCCAATTTACTGAATATATTTGAACCATGGACGCCGCCGTGGGATGGGATGGAAAATACTATAGCCGAAGACTTAGTGAAGCTTACTAATATTCAAATTTCAGAAACTGATAAGCATTCGAAATTCATATGCGAGACCTGCTACCAGGTTCTGCTTGGGGCATGCCATTTTACAGCCTGCGTTAGGAAAAGTTATCAAATATTACTCGAACGCTATCCTTGTGAGTCTCAAATAGATGTAAACAATAAAGTGTGGCCTAAACCAATACAAGTAGACAAGACTGTACACAGCTCAATGTACCAAAACCCTGTGGACGTAGAAATCAAACAGGAAGCTATTTCCGATGAAGAGTACAGTAATGCTATGGAGACATACGATGAAGGAAAGGATAATCTAGCAAATTTAGATATTAAGATTGAACCGGAGGAAATACAGATACAGAATATTCAAATAAAAGTTAACGGTACCATAATAGAAGGGCAACTTAATTGTGACTCGTCATCAGAACACGTAACTAATGGAAATAGTACAATGGAAGATAATGAGTTAAGGGATATTGTAAAGGAGGAGCCGCTAACAGATGATGATGAAAATGACAATCTAGCCTCGGATCTACCGCTGGAATGTCTCCTGTGTTCTAAAGCATTCAACAGTGTTACTGGATTGAAGGCCCATGTGATTGCACAACATTCCTATAAATCCGTGAAACGGAAGTCGAACAGTGTGTCGCCACAGAAGAAAAAATGTAATTATATCTGTGCTATATGTAGAAGACGTTTTTCAACATCCACGGATCTAATGGTGCACGAAACTTGTCACAATAAGAGCGTGTGTTATGGCTGCAATCAGAGCTTCGACACGTTCGCACAATTGACTGTACACAGAAGGACGTGTAAGGCCGTCGCCAGTCGGATGAGACATAAGACTCTGGATGACGTTTTAAGGCCTCAAACCCAAGTTAAACAGCCAAAAAAGATAAGGAAAAAGGAATTGCACTGCACAGAGTGTAATGAAACATTTAGTGACCTTTATTACAAGAGGATTCACGAAGAAGTTCAGCATAGTTTGACAAGTGACGATGTTAGTAATAAGGTTGAATCCATGGAGGTTGATGTACCTGGTAGGATATTAACAAGAAATAAACGAAAAGAAGGTTACAAGCAATAG

Protein sequence:

>DPOGS202179-PA
MTDQECNDTTKEDIYPVAECRACLQVLYSDSNLLNIFEPWTPPWDGMENTIAEDLVKLTNIQISETDKHSKFICETCYQVLLGACHFTACVRKSYQILLERYPCESQIDVNNKVWPKPIQVDKTVHSSMYQNPVDVEIKQEAISDEEYSNAMETYDEGKDNLANLDIKIEPEEIQIQNIQIKVNGTIIEGQLNCDSSSEHVTNGNSTMEDNELRDIVKEEPLTDDDENDNLASDLPLECLLCSKAFNSVTGLKAHVIAQHSYKSVKRKSNSVSPQKKKCNYICAICRRRFSTSTDLMVHETCHNKSVCYGCNQSFDTFAQLTVHRRTCKAVASRMRHKTLDDVLRPQTQVKQPKKIRKKELHCTECNETFSDLYYKRIHEEVQHSLTSDDVSNKVESMEVDVPGRILTRNKRKEGYKQ-