Monarch geneset OGS2.0

DPOGS209338
TranscriptDPOGS209338-TA1029 bp
ProteinDPOGS209338-PA342 aa
Genomic positionDPSCF300336 - 187252-190599
RNAseq coverage135x (Rank: top 56%)
Annotation
HeliconiusHMEL0132513e-6840.93% 
BombyxBGIBMGA014451-TA3e-3433.98% 
DrosophilaCG14711-PA2e-2227.20% 
EBI UniRef50UniRef50_B3LW172e-2226.71%GF17452 n=1 Tax=Drosophila ananassae RepID=B3LW17_DROAN
NCBI RefSeqXP_002017088.13e-2429.04%GL22112 [Drosophila persimilis]
NCBI nr blastpgi|1951523276e-2329.04%GL22112 [Drosophila persimilis]
NCBI nr blastxgi|1947409801e-2826.46%GF17452 [Drosophila ananassae]
Group
Gene OntologyGO:00056345.7e-10nucleus
GO:00082705.7e-10zinc ion binding
GO:00036761.3e-06nucleic acid binding
GO:00056221.5e-05intracellular
KEGG pathway 
InterPro domain[5-60] IPR0129345.7e-10Zinc finger, AD-type
[220-258] IPR0130871.3e-06Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL26199 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209338-TA
ATGATCGATTTATTTGATGAAGCGAAAGAGATTGTAAAGAAAATAGAATTCTGTACAGGAATTTCGTTAAAAGAAGATGTGGAACTTCCAAAAAATATCTGCAACACGTGCATCCGGAACCTGTCTGTGGCACATAAATTTAAAACAACATGTATTCTCTCCGAGAGAACTCTGCATAACTTGTTAGATGTGAAAGTAAATATCAATAGCAATATAAAAGACTTGGAAAACTGCAAAATAGAGCTTAAGTCAGAGGATTTATATGATGATGCATCAAACTGTGGATCGGAGAATGGAAACTGTGATTTAAAAAAAATAAAAACCGAATCAAATGAAAACAATTTACTGCCAGATGAAATTGAAAGAAACAGCGAGTCACGGAAAGCTATAAATACATCAAGGAAACGGGGTACATACAAGAAGGTCAGCCCAAGACGGCTCAAGAAGCTGAAATTCAGAAAGCTGGCCTGCGAACCCTGCGGACTAAAGTTCACTGATAAAAGACAATCCGATGATCACAAGTTACAGCACAAAGAGGAGCCCTGGATATGTGAGATCTGCGGCAAGTCATTCAGTCACAGGTCGTCGCTGCACTCGCACGTGTCGTCCCACACCCGCATGTTCTCGTGCGAGCATTGCGACTACAGCACGGGGAATAAGTTCGACCTCATCAAGCATATGCAGATACATCTAGGTCTAAAGCGTTTCAAGTGCACGGAATGCCCGGCCGTGTACCGCACGTCGTCCTCGAGGCGTGACCACGTCCGACGGACACATCTCCAGCTGAGACCTCACGCCTGTCACCTCTGCGACCGAACCTTCTACGATAGGACAAAACTTAACAGACACGTTGATTCACATTTTGATCTGAAACGGTTCGAGTGTGACATATGTCAGGCCTGTTTCTCCCGGAGGTGGTACTGGAAGAAACACCTGGAGAAGCAGCACAATGTAGTGGTACCACCCAAGCGACCCGGCAGGCAGAGGACCACGGAATATAGCTTTGATGTGACAGAGAATTTAAAGTAA

Protein sequence:

>DPOGS209338-PA
MIDLFDEAKEIVKKIEFCTGISLKEDVELPKNICNTCIRNLSVAHKFKTTCILSERTLHNLLDVKVNINSNIKDLENCKIELKSEDLYDDASNCGSENGNCDLKKIKTESNENNLLPDEIERNSESRKAINTSRKRGTYKKVSPRRLKKLKFRKLACEPCGLKFTDKRQSDDHKLQHKEEPWICEICGKSFSHRSSLHSHVSSHTRMFSCEHCDYSTGNKFDLIKHMQIHLGLKRFKCTECPAVYRTSSSRRDHVRRTHLQLRPHACHLCDRTFYDRTKLNRHVDSHFDLKRFECDICQACFSRRWYWKKHLEKQHNVVVPPKRPGRQRTTEYSFDVTENLK-