Monarch geneset OGS2.0

DPOGS205035
TranscriptDPOGS205035-TA1734 bp
ProteinDPOGS205035-PA577 aa
Genomic positionDPSCF300388 - 67312-78480
RNAseq coverage94x (Rank: top 62%)
Annotation
HeliconiusHMEL0225234e-10253.24% 
BombyxBGIBMGA001784-TA2e-2656.70% 
Drosophilasu(Hw)-PB7e-1626.57% 
EBI UniRef50UniRef50_D6WMT62e-1926.36%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WMT6_TRICA
NCBI RefSeqXP_967201.14e-2026.36%PREDICTED: similar to zinc finger protein 99 [Tribolium castaneum]
NCBI nr blastpgi|910925248e-1926.36%PREDICTED: similar to zinc finger protein 99 [Tribolium castaneum]
NCBI nr blastxgi|3660399611e-2528.96%zinc-finger protein 80-like [Mus musculus]
Group
Gene OntologyGO:00036768.3e-05nucleic acid binding
KEGG pathway 
Orthology groupMCL34575 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205035-TA
ATGTCATGTCTGTGTGTTGGGCGATGTATCCAAGAGGTTAAGGAAGAAAGATTAAAGCAGTATTATTTAGATTTGCTCAGAGAAATTCCTTTGAATGTAGACCTTCCATCCCCATGGCTCTGTTGGGAGTGTGTATCCCTGTTACAAAGAGTGGTGGCGTTCAGAGATCAAGTGAAAGACTCGTATAGGATACTACAAACTTATACTAAGGAGAATTTCAATGAATGCCTGCAAAGCGATGTGTCGAGATCTCCACGGTTGAAGTTGGCCAAGCAATTGTGTATTGATATACCGCCTGAAAATGTGAAGTTTGCAATTGACGAAGATGAGTTGACTCCGAGGAATAAGAGTTTTAATATAGACTTGGAACATGAACTCGAAGAGGTCCACACGGTCATATGTGATGTACAGCGAGGACAGGGGAAGATCACAAACAACGGAGGTCTGTTCTTCAACGAGGATGATCCGATGTGTACCCAACACGACGTCAAGAACGAGCCGTGCGACGATGTATTTACTGAGAAAATGAAAATAGAAGCCGTCGATGAATCGAATTTGACCCTGAGGAGAGAGAGAAAGAAGTTAAATCATAAAATAAAGCGACCGGAAGTTAAAGATTACTGTGAGAACGGAAGTAATGAAATAATTGAGGTCAAAATTGAGAATTATGAAGGCAAGGTTTCATTTAGGAGAGAAGACATCTCGGAGAAGAACAAAAAAGATGTCAAGATAAGTATTGATGAAAACAAGATGGAAGGAGATAACGAAGACACGAAGGACAAGGTTACAGACACTAAGTGCGTTAAGAATCAATACTACAAAACTGTGCATCTCTCCTACGAGGAGATGCAAGCGGAGAGACAGAAACTGCGCTGCGCGGAGAGTTTCTTGAGCTCTCCGTATAAATGCGAGTCTTGCATACTGGTTTACAACAACCAGAGGTCGCTGAAGATTCATAGAGAAAAGAGACATTCAGTCACAGGTAAATATACCTGTTCCATTTGCGACATAAACGTATCATCAGCTGATGAATTCACGTCACACTACAGGCGGCACATGAGGACTTCAGCGGCAACACACAGGTACCACAAGGAAAAGCATCACTCTAACAAACCGAGGATAGAGTGCGCCGATTGTGATAAAACTTTCAGTCATCGAGCCGGGCTAATGAATCACAGGTTGACTTTTCACGAGTATCAGAACAAGTTCCCCTGCAACGTGTGCAACAAGATATTCAGGTGGAAAACTAGTTTAAAACGGCATTTGAAGAAACATAATGAGTCTAAGGATAACCGCAGTAAAGCCTTCTGTGCTAAATGCGACATCGTGTTCTCGTCTGTGTGTTCGTTGCAGCGTCACCTCAGGAACAGTTTGAAACATGTCACCAGCGATCAATTGAAGTTCATCTGCGACCACTGCAACCATAGATTTGCTGACAAGACGAAGCTGAGAGACCACATTGAAGAGAAGCACCTGTTCAGGACCTACCAATGCCATATCTGCCATAAGCCGTCCAAGAACAGGGTCGGACTGGAACAGCACATACGAACAGTGCATAAGGGAAGGCCGAATAACAGGATGTGTCACCACTGTGGCAAAGGGTTCCCCGTACACCTTAACATAAAATCGAAGAGGTATCCTATGTGCAGGAAACGGAAGGAGAACAATCAACCGGATGTGGCCGTCTTCACACTTCCGGTCCACTTTATGCCGGATAACGGATATGCTATATAA

Protein sequence:

>DPOGS205035-PA
MSCLCVGRCIQEVKEERLKQYYLDLLREIPLNVDLPSPWLCWECVSLLQRVVAFRDQVKDSYRILQTYTKENFNECLQSDVSRSPRLKLAKQLCIDIPPENVKFAIDEDELTPRNKSFNIDLEHELEEVHTVICDVQRGQGKITNNGGLFFNEDDPMCTQHDVKNEPCDDVFTEKMKIEAVDESNLTLRRERKKLNHKIKRPEVKDYCENGSNEIIEVKIENYEGKVSFRREDISEKNKKDVKISIDENKMEGDNEDTKDKVTDTKCVKNQYYKTVHLSYEEMQAERQKLRCAESFLSSPYKCESCILVYNNQRSLKIHREKRHSVTGKYTCSICDINVSSADEFTSHYRRHMRTSAATHRYHKEKHHSNKPRIECADCDKTFSHRAGLMNHRLTFHEYQNKFPCNVCNKIFRWKTSLKRHLKKHNESKDNRSKAFCAKCDIVFSSVCSLQRHLRNSLKHVTSDQLKFICDHCNHRFADKTKLRDHIEEKHLFRTYQCHICHKPSKNRVGLEQHIRTVHKGRPNNRMCHHCGKGFPVHLNIKSKRYPMCRKRKENNQPDVAVFTLPVHFMPDNGYAI-