Monarch geneset OGS2.0

DPOGS204904
TranscriptDPOGS204904-TA1728 bp
ProteinDPOGS204904-PA575 aa
Genomic positionDPSCF300340 - 157125-173074
RNAseq coverage6x (Rank: top 87%)
Annotation
HeliconiusHMEL0107342e-7139.83% 
BombyxBGIBMGA001777-TA8e-3927.43% 
DrosophilaCG6654-PA2e-2627.89% 
EBI UniRef50UniRef50_G3X9964e-3929.85%RIKEN cDNA 2210010B09, isoform CRA_a n=5 Tax=Murinae RepID=G3X996_MOUSE
NCBI RefSeqXP_002739995.11e-3428.53%PREDICTED: zinc finger protein 347-like [Saccoglossus kowalevskii]
NCBI nr blastpgi|1973848646e-3930.10%zinc finger protein 426-like 2 [Rattus norvegicus]
NCBI nr blastxgi|2607893652e-4631.73%hypothetical protein BRAFLDRAFT_100841 [Branchiostoma floridae]
Group
Gene OntologyGO:00036767e-09nucleic acid binding
KEGG pathway 
InterPro domain[512-538] IPR0130877e-09Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL23304 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204904-TA
ATGTCTGGTTTCATGCAATTAATCTTACCCGACTATACAAAAGAGGTTCCGCAGGCAAAATGCTGTATTGCTTGCCTTAACAAAAATGTGATTTCTATCGATTTAAACACTTGTAAACATGCTGCAATCATTGAATATTTATTAGATTATAAGCCAAACTTAAAAGAGGCCATTGTATGTTTAAACTGTCACAGTCTGATAAAGAAGATAGAGAAATTCAGGGTTGAGTTGACGGAGAATATGAAATTATTTGAAGAGTCTCAAACAAATATTCTTAATGTCAATCGAAGCGGTCTTGGTTATTCGAATATCGAAATCTATTCAACGATCGACTCCCAGCCAATAGAAATGGAGATAAAATCGACCAATCAGGTGTCGCCGATCGAAATTATGACAGAAGTCAAAATCGAATATGACAGCTCCAATTCGGAACACGTCCCATTGGTGGAGATTCAAAAAAGGAAACCCGTCAAAAAGCCACAGAATATAAAGAAGAATAAGAAAGAATTAAAAAATAAGCCTGTCAAGAGAAATGTGTCGTCGCTACAGAAACAGTACGAGGGTAAAATTCGGATAGTCGTACTGAGTCAAGACGAAATGTTGGAGGAGAGACAAGTTGAGGCCAAGAAGCCGAGTTATCTGAGACTTCCATACAAATGTGATCTCTGTATAACGGCTTTCGACCACGAGTTGACTTTGAAGAGTCATATAGAGTCTAGACATAATAAATCTGGTGAATACGAGTGCTGTGTTTGCAAGTCTCACCTCTCCACTAAAATATCATTTGACGAGCATTACAAGAGACATTTCAGACGCACGGTTCACGGCCAGTCGAGTCGCGTGTACGGCTGTGATAAGTGCAACAAGGTGTACAAGGCTAAGTCCGGTCTGAGCGCGCACATCGCGACGCACTCCTCGCCGGTCTACTGCAGGGACTGCGACACGCACTTCAGGACACCGCACGGACTCAGACACCATCTCAAGACTCACTCTAGACACGTGGAGGATAATGATAAGAGGTTCGTGTGCAAAGATTGTGATCTGAAGTTCCTAACGCCGAAGTCCCTGAGGGAGCACGTGGATTGGGTTCACTTGAACGACACGAAATACGAGTGTGACTCGTGCTCTAAGGTGTTCAAGAATAAGAACAGCCTGAAGAAACATTTTCAATACGTGCACGAAAAGAAGAGACCTCCGAGGAATAAGATCTGTGATCACTGCGGCAGAGCATTCACTTCGCGTATGATATCCGTCTCCGAGTTCCCAAACCTCTTTGTGTTAAGGTTCGTGTGCAAAGATTGTGATCTGAAGTTCCTAACGCCGAAGTCCCTGAGGGAGCACGTGGATTGGGTTCACTTGAACGACACGAAATACGAGTGTGACTCGTGCTCTAAGGTGTTCAAGAATAAGAACAGCCTGAAGAAACATTTTCAATACGTGCACGAAAAGAAGAGACCTCCGAGGAATAAGATCTGTGATCACTGCGGCAGAGCATTCACTACGCTACAAATCCTCCGATCCCATATCCACACTCACACGGGCGAGCGGCCGCACCGCTGCGACGTCTGTGGCGCCTCGTTCGCTCACAAGGGGGCGCTTTACACACACAATAAGGCGAAGGGCGACCTCGTGTGTGAGCCGTGTAACAGAACCTTCTCCTCGATAGCCACGTACCAGCAACACATGAAGATCAGCAAGAAACATGCGACCGAAAACGATTTCAAGTGA

Protein sequence:

>DPOGS204904-PA
MSGFMQLILPDYTKEVPQAKCCIACLNKNVISIDLNTCKHAAIIEYLLDYKPNLKEAIVCLNCHSLIKKIEKFRVELTENMKLFEESQTNILNVNRSGLGYSNIEIYSTIDSQPIEMEIKSTNQVSPIEIMTEVKIEYDSSNSEHVPLVEIQKRKPVKKPQNIKKNKKELKNKPVKRNVSSLQKQYEGKIRIVVLSQDEMLEERQVEAKKPSYLRLPYKCDLCITAFDHELTLKSHIESRHNKSGEYECCVCKSHLSTKISFDEHYKRHFRRTVHGQSSRVYGCDKCNKVYKAKSGLSAHIATHSSPVYCRDCDTHFRTPHGLRHHLKTHSRHVEDNDKRFVCKDCDLKFLTPKSLREHVDWVHLNDTKYECDSCSKVFKNKNSLKKHFQYVHEKKRPPRNKICDHCGRAFTSRMISVSEFPNLFVLRFVCKDCDLKFLTPKSLREHVDWVHLNDTKYECDSCSKVFKNKNSLKKHFQYVHEKKRPPRNKICDHCGRAFTTLQILRSHIHTHTGERPHRCDVCGASFAHKGALYTHNKAKGDLVCEPCNRTFSSIATYQQHMKISKKHATENDFK-