Monarch geneset OGS2.0

DPOGS203942
TranscriptDPOGS203942-TA1293 bp
ProteinDPOGS203942-PA430 aa
Genomic positionDPSCF300005 - 80893-82576
RNAseq coverage16x (Rank: top 81%)
Annotation
HeliconiusHMEL0120812e-16972.24% 
BombyxBGIBMGA012275-TA2e-13260.47% 
DrosophilaCG17385-PA9e-3138.32% 
EBI UniRef50UniRef50_E9Q2S71e-3338.32%Uncharacterized protein n=5 Tax=Murinae RepID=E9Q2S7_MOUSE
NCBI RefSeqXP_001945749.11e-3238.61%PREDICTED: similar to mCG7830 [Acyrthosiphon pisum]
NCBI nr blastpgi|2030974044e-3338.32%zinc finger protein 426-like [Rattus norvegicus]
NCBI nr blastxgi|3266672554e-3738.96%PREDICTED: zinc finger protein 91-like [Danio rerio]
Group
Gene OntologyGO:00036762.6e-13nucleic acid binding
GO:00082707.6e-05zinc ion binding
GO:00056227.6e-05intracellular
KEGG pathway 
InterPro domain[280-306] IPR0130872.6e-13Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL25941 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203942-TA
ATGGAAGATATCCAGAATGTTTTAATTACAAATTCGGAAGGTGTTTTAAATAAGGATTATATTTCAGGCTTTATAGAGAAACCTGAACAGGATGGATCAGCTGTTTTTATTTGTCGCATTTGTAAGAAGACATTTTATAATGCTAACGCGCTTCAAAATCATAAGAATTTGCAACATGACGATATTGGTTCCCTTTCTGATGATGATAATATTTCTTATTCAGTCAATTCCAATGACCAGATTTATGAAGATCTCTGTGGTTTTAAATTCATCCCTTTAAAAAAAAACGAAGAAGATTTGATTATAGAGGAGGATGGCATTTCAAAACGCACGTTTAGGTGTGATACCAGCTACCTCATTGTGAAATATGAAAGCAATGAAAGTGTAGGTGTAAAGAGATCTCGTTTGGATAACATGATCAGAAAAATTCAGTCGAAGCCAGAAAAGAAACCAGTCGACTTGAGGGGACCATTTACGTGCACACTTCCCTCGACTTTGCGACCCAGTGTACAATGTCGCCAGATTTTTTTTAACTGTTGCGAATATTCGTTACATTACCGAGAAGAACATACCAAACGAAGGAAAGCCGCTTTACGCTGTCAAGTTTGCGAGAAACGTTTGGACAGAGACTTTTATCCAACGACCACTCTAACTGAGCAATTAAACACCCAGAACACTTCATCTTTCTCGTGTCGTATATGTGGGTTTACATTTTTAGATAGTACAGATTTCGATGAGCATAATCGCATAGTGCACGCAAAAATGAAACCACACCAGTGTAGTTTGTGCTCCAAACGCTTCACACAACTTGGTGGTCTCCAACAACACATGCGTATGCATACAGGAATACGTCCGTTCGTATGTAAATTCTGTTCGAAAGCATTTACGCAAAAGGCTGGTTTGGATCAACATTTACGAACTCACACTAAGGTAAAACCATTTAAGTGTATCATTTGTTGCAAATGCTTTTCGCAGTCAGTTCATTTACGTCAGCACATGCGAACACATACAAATATCCAGCCTTTTGGATGTCCTATATGCAATAGGAGATTTAAACAGAGTAGCCATTTAAACTTTCATATGCGTTCTCATGTTGGAGAAGCGAGTGCATTGATCATGGAACAATATGCTCAGGCTATGCAGCAACAAGGTCAAATGGACTTTCTCAATTTTTCAAAAGTGCAGCCAGTTCAGGATGGTGAAACCATTTACTATTCTGCAGAATTAGCTTCGATGCCTCCCGAAGCCGGTGCAGCTAATAGTAAATATTTCCTTTCTAATAAAGGCATCTGA

Protein sequence:

>DPOGS203942-PA
MEDIQNVLITNSEGVLNKDYISGFIEKPEQDGSAVFICRICKKTFYNANALQNHKNLQHDDIGSLSDDDNISYSVNSNDQIYEDLCGFKFIPLKKNEEDLIIEEDGISKRTFRCDTSYLIVKYESNESVGVKRSRLDNMIRKIQSKPEKKPVDLRGPFTCTLPSTLRPSVQCRQIFFNCCEYSLHYREEHTKRRKAALRCQVCEKRLDRDFYPTTTLTEQLNTQNTSSFSCRICGFTFLDSTDFDEHNRIVHAKMKPHQCSLCSKRFTQLGGLQQHMRMHTGIRPFVCKFCSKAFTQKAGLDQHLRTHTKVKPFKCIICCKCFSQSVHLRQHMRTHTNIQPFGCPICNRRFKQSSHLNFHMRSHVGEASALIMEQYAQAMQQQGQMDFLNFSKVQPVQDGETIYYSAELASMPPEAGAANSKYFLSNKGI-