Monarch geneset OGS2.0

DPOGS203244
TranscriptDPOGS203244-TA1110 bp
ProteinDPOGS203244-PA369 aa
Genomic positionDPSCF300210 + 1041-3817
RNAseq coverage102x (Rank: top 61%)
Annotation
HeliconiusHMEL0087074e-13767.32% 
BombyxBGIBMGA010714-TA2e-3628.12% 
DrosophilaCG15436-PA2e-2526.32% 
EBI UniRef50UniRef50_D6X4Z74e-2829.57%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6X4Z7_TRICA
NCBI RefSeqXP_001814647.18e-2929.57%PREDICTED: similar to zinc finger protein 617 [Tribolium castaneum]
NCBI nr blastpgi|1892417832e-2729.57%PREDICTED: similar to zinc finger protein 617 [Tribolium castaneum]
NCBI nr blastxgi|1892417833e-3129.14%PREDICTED: similar to zinc finger protein 617 [Tribolium castaneum]
Group
Gene OntologyGO:00056347.1e-15nucleus
GO:00082707.1e-15zinc ion binding
GO:00036761.9e-08nucleic acid binding
KEGG pathway 
InterPro domain[40-110] IPR0129347.1e-15Zinc finger, AD-type
[274-305] IPR0130871.9e-08Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL34487 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203244-TA
ATGACTTCTCTAGTTTTACTTAAATCAATTTTATTGGCGAGTGTTTTGCAAAAAAACAAAAAGAAAAAAAAAATTAAATCAAAGAAATTGTTTTCTACAAACAAAATAGATTTAAAGATATGTCGTATATGCAATGAAGAAAAAGGAGAAATACCGATATTTAATGATTGTATACAGCCTAACATACCAGAAGAAATAAAACATTTTTCAGGAGTTTCAATATTTAAAACCGATAATTTACCAAAGCAAATATGTCAAAACTGTTTGGATTTACTAAATGGCTGTATAATGTTTCGAGAAATGTGCCAAAAGACGAACCAGCTTATGAAAAGAATTTCCTCGGTCCCAGAAATTGCACCTTGTATCAGAAAGGAGGAGGATGTGAAGAAAGATTTTACAATTGACCAAAATGGAAGTGATGAATCATTGAACATACCTTCACCGACATGTTCAGAAGACAATTCAGAAATATGGAACTGTACTTCTTGTAAAACAGAATATTACAATCTGGAATCATACAACAAACATTTAGTTGAATGTAAAACACAAATCAACGATTTGAAACCTGAATTAGATCGGAAAAGAACAAAAGAGACATTCCTCTGTGATATTTGTGGAAAAGTTGCCCGTTCCAATGCAAATCTTCTCATTCATATGGCCATACATGAGAATATATTTCCTTTCAAATGCGATCAATGTCCTTACCAAGGGAGGACAATGGATCTGCTCAGAGTACACAGACGAACACACATGGCTGATAAGCCATTCAAATGTGCACACTGCCCTAAATCTACGACCACGTCCAGTAACCTAATGAAGCACATACGACATGTTCACAGCAAAACTAGGCCTTATAAGTGCACATATTGCGATAAAGCATTCTCTTACCAACATGATATGAAGAGACACATAAAAGACATTCATCTAAGACAGGGTACCGTTGAATGCGATATATGTTTCAAAAAATTTAATACAAAAAAAATATTACAAGGACATCGTTTTAAGATTCACAAAATAAAAGGCGAAAGACACGGACGTTTGCCTTCTTATTTGCAATGTCAACAACCCAACGAAAATAAACAAAATGAATGCGAGTCTGTTGTCTATTAA

Protein sequence:

>DPOGS203244-PA
MTSLVLLKSILLASVLQKNKKKKKIKSKKLFSTNKIDLKICRICNEEKGEIPIFNDCIQPNIPEEIKHFSGVSIFKTDNLPKQICQNCLDLLNGCIMFREMCQKTNQLMKRISSVPEIAPCIRKEEDVKKDFTIDQNGSDESLNIPSPTCSEDNSEIWNCTSCKTEYYNLESYNKHLVECKTQINDLKPELDRKRTKETFLCDICGKVARSNANLLIHMAIHENIFPFKCDQCPYQGRTMDLLRVHRRTHMADKPFKCAHCPKSTTTSSNLMKHIRHVHSKTRPYKCTYCDKAFSYQHDMKRHIKDIHLRQGTVECDICFKKFNTKKILQGHRFKIHKIKGERHGRLPSYLQCQQPNENKQNECESVVY-