Monarch geneset OGS2.0

DPOGS205243
TranscriptDPOGS205243-TA1398 bp
ProteinDPOGS205243-PA465 aa
Genomic positionDPSCF300265 + 325977-328512
RNAseq coverage217x (Rank: top 45%)
Annotation
HeliconiusHMEL0221390.073.06% 
BombyxBGIBMGA008740-TA9e-15759.43% 
DrosophilaCG12299-PA5e-2730.04% 
EBI UniRef50UniRef50_UPI0002026F8E2e-2929.61%UPI0002026F8E related cluster n=1 Tax=unknown RepID=UPI0002026F8E
NCBI RefSeqNP_500033.11e-2934.07%hypothetical protein Y55F3AM.14 [Caenorhabditis elegans]
NCBI nr blastpgi|3016264052e-2939.56%PREDICTED: zinc finger protein 585A-like [Xenopus (Silurana) tropicalis]
NCBI nr blastxgi|2914018136e-3727.30%PREDICTED: PR domain containing 5 [Oryctolagus cuniculus]
Group
Gene OntologyGO:00036766.4e-11nucleic acid binding
GO:00082705.1e-05zinc ion binding
GO:00056225.1e-05intracellular
KEGG pathway 
InterPro domain[395-419] IPR0130876.4e-11Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL25534 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205243-TA
ATGCCAATGTTGTCAGATAAAAAACTGCCGTTGGTCAAATCTGAGCCGGGAGACACTGATGACAATGATTCCGACAACTACTTCCTCGCCAGCACCATTAAAAATGAATCGCCAGCACCCATCCAGAAGATTGTGAGAAAAGTTAACATAAAGAGTAAGAAGAAGGAGCGTATAGAGAAGAAGTTTAAGAATCTGGTCAACAAATTGGCTATAAATAGCTGTGAGAAAAAAAAAATGAAAAAAGACATTAAAGTTATGAAAGGCAGGAAGTTGACTATCAAGAAGGATATAGAGATTAACGTGACATTTACCAATACGAGACTGACACAGGATAAGCAGAAACACAGAGAGAACTTGCTAACAATATTGAAATATTCAAACGCTACACCCTTCAAGGACAAGTCTCTGCTCGGCTTCATCTGTGGATACTGTAACGCTTCGTACCCCGATCCAACGGATTTGAAGAATCACACGGATTTGGATCATATCAAGGAGCGTCTGGACTTCAAGTCCTCGTTCGACATGACAGAGTACAACGTGAAGCTGGACGTGGTCAACTTGATGTGCACGTTGTGCGGCGAGAGAATGGAGAATCTGTACAAGCTAAGAGATCATTTGATCAAGACCCACAACAAGGTCTTCCACAGAGACATAAAAGACCACCTGCTGCAGTTCAAGCTAAAGAAGGGCGATGTTTTCGACTGTGCGCTGTGCCCCTCGACGTACGAGACTTTCAAGATGCTGAAGCAGCACATGAACAAGCATTACTGCAACTACAGCTGCAGCAAATGCGAGAACTCGTTCGCAACCAAACGCTCGCTGAACACCCACCAGACTACACACGAGGAGGGTAGTTTCAAATGCGACCATTGTGACAAAATATTCTCGACTAAAACAAAAAAGCAGTATCACGAGAAGACCAAGCATTTGGGTGCTAGGAATATAAGTAACTGCCCCTACTGTGACGTGCCTTTCAGGAGTTACTATCAGAGAAACCAACACCTGGTGAAAGTTCACAACTCTGAGGCCCAATACAAATGCAACGTGTGCAGCAAGGGTTACATACTGAAATCACTCCTGATGTGTCACATAAAGAAGAATCACTTGATGGAGAGGAACTGCCAATGCACGGAGTGCGGCTACAAGTTCTTCAGCAAGAAAGCCCTCAAGGCGCATATGATAAAACACAGCGGTGAGAGGAAATTTATATGCGAAGTGTGCCACAAGTCGTATGCAAGGAAGTACACGCTGAGAGAGCATATGCGGATCCACAATGATGATAGGCGGTTCAAATGTGATATTTGCGGTACGGCCTTCATACAGAAATGCAGTCTGAAGTCACACCTACTGTCCCATCACGGTATTAGTTTAGCTGCTAGCGATATACCTATATCATGA

Protein sequence:

>DPOGS205243-PA
MPMLSDKKLPLVKSEPGDTDDNDSDNYFLASTIKNESPAPIQKIVRKVNIKSKKKERIEKKFKNLVNKLAINSCEKKKMKKDIKVMKGRKLTIKKDIEINVTFTNTRLTQDKQKHRENLLTILKYSNATPFKDKSLLGFICGYCNASYPDPTDLKNHTDLDHIKERLDFKSSFDMTEYNVKLDVVNLMCTLCGERMENLYKLRDHLIKTHNKVFHRDIKDHLLQFKLKKGDVFDCALCPSTYETFKMLKQHMNKHYCNYSCSKCENSFATKRSLNTHQTTHEEGSFKCDHCDKIFSTKTKKQYHEKTKHLGARNISNCPYCDVPFRSYYQRNQHLVKVHNSEAQYKCNVCSKGYILKSLLMCHIKKNHLMERNCQCTECGYKFFSKKALKAHMIKHSGERKFICEVCHKSYARKYTLREHMRIHNDDRRFKCDICGTAFIQKCSLKSHLLSHHGISLAASDIPIS-