Monarch geneset OGS2.0

DPOGS211230
TranscriptDPOGS211230-TA1443 bp
ProteinDPOGS211230-PA480 aa
Genomic positionDPSCF300007 + 1438562-1441565
RNAseq coverage366x (Rank: top 32%)
Annotation
HeliconiusHMEL0093800.077.92% 
BombyxBGIBMGA003209-TA5e-10468.75% 
DrosophilaCG10492-PA6e-2943.71% 
EBI UniRef50UniRef50_E2AJ512e-3746.15%Zinc finger CCHC domain-containing protein 2 n=1 Tax=Camponotus floridanus RepID=E2AJ51_CAMFO
NCBI RefSeqXP_001846486.11e-3350.77%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|3071773188e-3746.15%Zinc finger CCHC domain-containing protein 2 [Camponotus floridanus]
NCBI nr blastxgi|910828852e-3929.93%PREDICTED: similar to AGAP009083-PA [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL24863 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211230-TA
ATGTGCTTACCATTTGAACTACGCTTTATTGGCACGTATCTGGAGGAACTCGGTAAGCGAGACTTTCAGGAGTTGCGTGGTGCCGAACTGAGAGCCAATAACCCTACAGATCTGGCTGCTGATATCGGAGGAGCGGATACAGATTCTAGAACTCGACGGAAAATGGCTTTATACGTTTCATTATTGCGGTCTTGTAATTTTGCTTGTGCAACCACATTGTATAATGCCATGGTCAGTTTAGAACAAAGTGGTCTTTTGAAAGGCTTGTATGGTGATCTCCTTGAAGAGATTCTTTTACTGTATACTATGGCGTTACACCATCCAGCATTCACATTCGATCAGAAGTCACATTTTGGCGATATTTTAGAGAAGCTTAAGATAGAAGATCAGAGGATACAGTATCAAGAGCAGCAAGAGCGTCTAAACTTAGCTCCTGTTAGTGGTTGTCATACACAGGCTAGTATCACCCCAAATATGTCAGTACCACCACCAAACTTGCCAAGCTTGGGACCACCGCCTGGTATTATGTTTGTTAAAACATCTGGTGTGCAGCCACCGAGTTCTGATGCTATCGCAGAGCTCAGTTCCCCTCCTGTGTTGACTGGAGCTGGTGGTGTGGTTCCTATGCTCGGGGAAGTACCACACCCCCCTCCAGGTCTACCAATACCGCAGTACAACATAAATGAATTCATAGGCCAGCACACTTGGCCGGCCAACATAATTGTTTCACCTCTGAATCAACCTCTTGAGGTGTTGAACTTCCAGACTGCTGGAGGTGTCGCACCTTCTTCACCGCTGGTGTCTTCTCCAGGTGACAGCCGTGCAGCATCTCCACGAAATCGCCGGCGTCCATCCCGTGACCGTTCACCCCCACTAGTGCCCATTCCGCCTGAATCTATGCCTCATCTCACCACAAATTTCGAGAATCTTGCTGTAGCTGATATGAGACATAACATTGGGGAGGAAAGACTTCGTGAAATCACACACCATCAGTACCGGTCGTTGGAGAAATTGAACGGTGTGCGGAGGCGAGCTGGGGCTTACTGTCCACCTACACGTTCATCGTCCGACAGTGGTTCCAGCAATGGGTCTTCCCCCCCTAGTACACCGGCAGCACCACGACGTGTGACGCCGTCTGCGCCGCCGCCTGTACCATATTTACCTTACCCTCGCCCGTACCCCTACCCCACCCCACCTCCCGCCTACCGCCCGCCCCCACCGCCATTCGCGAACGGGGAACCCCCTCCGTACCCCGCACCTGCCCCATACGCCGGTTACATGCCTGTGTTGTACGCGCCACCGAAGTTATCCTGTTGGAATTGCGGCGCGTCTGGGCACGCTGGTCATGAATGCAAGGAGCCTAGTCTCGAAGAGATGACGCGGGCTGGCGGATATCAGCTCGACTTCGGCGGAGCTCCACCACCTGAAGCGGGGGACAAGTAG

Protein sequence:

>DPOGS211230-PA
MCLPFELRFIGTYLEELGKRDFQELRGAELRANNPTDLAADIGGADTDSRTRRKMALYVSLLRSCNFACATTLYNAMVSLEQSGLLKGLYGDLLEEILLLYTMALHHPAFTFDQKSHFGDILEKLKIEDQRIQYQEQQERLNLAPVSGCHTQASITPNMSVPPPNLPSLGPPPGIMFVKTSGVQPPSSDAIAELSSPPVLTGAGGVVPMLGEVPHPPPGLPIPQYNINEFIGQHTWPANIIVSPLNQPLEVLNFQTAGGVAPSSPLVSSPGDSRAASPRNRRRPSRDRSPPLVPIPPESMPHLTTNFENLAVADMRHNIGEERLREITHHQYRSLEKLNGVRRRAGAYCPPTRSSSDSGSSNGSSPPSTPAAPRRVTPSAPPPVPYLPYPRPYPYPTPPPAYRPPPPPFANGEPPPYPAPAPYAGYMPVLYAPPKLSCWNCGASGHAGHECKEPSLEEMTRAGGYQLDFGGAPPPEAGDK-