Monarch geneset OGS2.0

DPOGS211463
TranscriptDPOGS211463-TA1491 bp
ProteinDPOGS211463-PA496 aa
Genomic positionDPSCF300223 + 245298-246889
RNAseq coverage4x (Rank: top 89%)
Annotation
HeliconiusHMEL0153390.085.51% 
BombyxBGIBMGA002168-TA0.082.83% 
DrosophilaCG11966-PA2e-8142.34% 
EBI UniRef50UniRef50_D6WDR93e-10151.06%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WDR9_TRICA
NCBI RefSeqXP_971330.15e-10251.06%PREDICTED: similar to AGAP002993-PA [Tribolium castaneum]
NCBI nr blastpgi|910793829e-10151.06%PREDICTED: similar to AGAP002993-PA [Tribolium castaneum]
NCBI nr blastxgi|1571108052e-10550.21%hypothetical protein AaeL_AAEL000826 [Aedes aegypti]
Group
Gene OntologyGO:00036766.9e-14nucleic acid binding
KEGG pathway 
InterPro domain[460-491] IPR0130876.9e-14Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL15730 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211463-TA
ATGCCTAGTGAAGCGAAGCGACGCGCGCCGGCAAAACAACGCGGCAGGGTGGCGGCTTCCAAGTATGGCGGTGGGGTGTCTCTCTCGGGTGTTGCGGTGAGGCGCGGCATGAAGTGCGCCGGTGTGGCCGGCGGCGGCGACGAGCTGCTGCTGGGCGGCTGCTGGCTGGACCCCAAGGACGCCTCCCCTCCGCCCTGCCACGAGCTCCTCGACTCTCTCCTCGCCCCTCCACCACTGGCCGAACTCAAACCTCTCCCGCCCTTCACCGGATATACCGGCCACCTCTCCATCAACGGCATCTCCGGTCATCATTTCCATGCTATAGCACAACGTCTTCCAGAAGAAAACAACAACTATCCCACGCAGAGCGGTTACGGTGCCGATCAGGATGTGGTCTCCTCGTCTACGTGTGCCGCTGAATCCGAGCCTCCGGATCTCGAGGATGTCAAACCTTTCCCTCTGGACGCCCCGGATACCAAACCGTTTGCAGAGTCGTCGGCACCCGGTGTAGCGGACTCCTGTGCTCAATCTTGTTCCCCGCCTGCGAAAATATTTGATGATGGACATAAACAAGACATGTACGACATCAGTTCAATAGAGGACTTAGCAGCCATCATTGGCTCCGCGATCGCTGATACGACTGTTCAGTCTCAGCCGGAGGATCCTGACAGAAATGATTCTAGAGATAGCTGGATGGACATAGATGCATGGATAGCGGGCGCTTGTAGTCAACATGGTGACAAAATAGTTCAACAAGACCTATCGGAATTCGGTTTCGGTTCTCCGCCACCATCACAATCGAGTCACCAGTCCAGTATAGACTTTCCATGCAAGCCCAGCCAAGAGTTCTCGAAGAGAGAAGAATTTCTTCAAAAGAATTTAGGACAGGCTGGATCTACTCTCCAGTCGCTATTATCTCATGGCTACATGCCTTTATTGCAAAACAGATTGCAAAATGGTCCCCCTGTTAAACAAGAGGCTCCAAGTTCGACGAGTTGTGGCATGGACGTAATATCAACGTCGTCGCCACCAGGGAACGCCGTTTCAACCACAGACGGTGTTGGAGGATTGCTCAATGGCAGATATGCACCCCATTATGCTCTCGGGCTAAAACTGGATGGACTCTGTAGTCCAGAAAGATTACTCGGCTATTCACACGCCCACACGACTACGGTTACTCCATCCAGTAAAAGTAAACGTAGTCGAGCAAAAGCAAAACAAGTAAACAACGGAGGAAGTCTGCATGGAAGTTCGTTAGCCTTCGCATCGGCAACATCTGCGGAACTGAGCGGCTTGCTAGGGAAAGAGAAACCGGTGCATCGCTGTGGGATCTGTAATAGAGGGTTCTTGAACAAGTCCAACATAAAGGTTCATCTCCGGACTCACACCGGGGAGAAGCCCTTCAGATGCGACGTATGCGCAAAGGCTTTCCGTCAGAAGGCGCATCTGATAAAACATCAGCAGATCCACAAGAGGATTGGGCGGGACTAG

Protein sequence:

>DPOGS211463-PA
MPSEAKRRAPAKQRGRVAASKYGGGVSLSGVAVRRGMKCAGVAGGGDELLLGGCWLDPKDASPPPCHELLDSLLAPPPLAELKPLPPFTGYTGHLSINGISGHHFHAIAQRLPEENNNYPTQSGYGADQDVVSSSTCAAESEPPDLEDVKPFPLDAPDTKPFAESSAPGVADSCAQSCSPPAKIFDDGHKQDMYDISSIEDLAAIIGSAIADTTVQSQPEDPDRNDSRDSWMDIDAWIAGACSQHGDKIVQQDLSEFGFGSPPPSQSSHQSSIDFPCKPSQEFSKREEFLQKNLGQAGSTLQSLLSHGYMPLLQNRLQNGPPVKQEAPSSTSCGMDVISTSSPPGNAVSTTDGVGGLLNGRYAPHYALGLKLDGLCSPERLLGYSHAHTTTVTPSSKSKRSRAKAKQVNNGGSLHGSSLAFASATSAELSGLLGKEKPVHRCGICNRGFLNKSNIKVHLRTHTGEKPFRCDVCAKAFRQKAHLIKHQQIHKRIGRD-