Monarch geneset OGS2.0

DPOGS203674
TranscriptDPOGS203674-TA2058 bp
ProteinDPOGS203674-PA685 aa
Genomic positionDPSCF300010 - 2280870-2285283
RNAseq coverage1331x (Rank: top 10%)
Annotation
HeliconiusHMEL0133370.081.68% 
BombyxBGIBMGA003468-TA0.060.73% 
DrosophilaCG8108-PB2e-4332.15% 
EBI UniRef50UniRef50_F4WXI28e-8140.62%Zinc finger protein on ecdysone puffs n=8 Tax=Pancrustacea RepID=F4WXI2_ACREC
NCBI RefSeqXP_001121361.19e-8143.46%PREDICTED: similar to CG8108-PB, isoform B, partial [Apis mellifera]
NCBI nr blastpgi|3287842386e-8142.33%PREDICTED: hypothetical protein LOC725524 [Apis mellifera]
NCBI nr blastxgi|3072057242e-8637.60%Zinc finger protein on ecdysone puffs [Harpegnathos saltator]
Group
KEGG pathway 
Orthology groupMCL18818 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203674-TA
ATGGAGGTCGGGGTAACTACAGCGGCCGTGGAAGTGGTTATAGAGGATCATTTCGTGGTAACGGGAGTCGCGGCGGATATGATGGCGGCCGTGGGGGCCGAGGAGGCGGCTACTCAACCTATAACGAAAATCGATACAACAGTAACAATGCCAACAGATATTCATCAAGTCGGGATCGTTGAAAGTTCAGCTAGTTATTCAAATCGTGACTATGGTGGTCGATCAGGCTCTCCAGAACGCAAACGGATGAGAATGGAGCATAGGAGCGATGCAAACAGAATTATCTACAGGGCTCATCGAGCGATAGACGTAGCCACGATGGCGGCAGTCACTATGGCGGCTCGTACGGCGGTAGACAGGAGGGTTACGGCGAGCGGCGGTCGTTCGCGAGCGAGGACAGGCGGCGGTCGCCGGCTCGAGAGTATCGCAAGCCCAGTGGCATGGGGCCGCCGCGAGAGCCGCTCCGAGCGCTCGTCCGGCCGCGGGCTGCGCGACGCTCATTCCGCGGACGAACATTGCGCACTCGCCCCCTCTATCGGGGAGCCCCCCGTTCCCGTGGATCCTTCTCCTCTAGGCGATTTGCTGAAAGATCGCTGGGGTACACCCGCACATTTAGAACTACTAAGGGGCGAAGGTAATGCTAAATGCAAAATAATGTTTAAGTTAGTTTCCGATCCCATGGAAGGTGAGCCTGATTCACAAAGCTCTGTTAAATCAAAAGAAGATGAAGCATCTTCCACAGAGGAAGATTGGGAAGCTGATGAGAAAGAGGAAGTCATAGAAGAGAAGAAAGAAACCAAGAACAAGTCGCCAGAAGTCAGTGCACCAGAAGTCGAGGGGTCAGAAGGTGAGGCAGGTGAAGGCGGAGAGGATACTGATAAAGAACCAGATGCTGCGTCAGATACGGCTCCATCGCGTCCCTATGTTCATCTTGCCTGTGTTCACTGTAAAGAGAAATGTGTTACTTTTGGAAGTTACACCAAACACCTTTTGTCGAGTAAGCATCGTGCCGCTATGAGTTCAGTGGCTCGTCGCCATAAGCTAGAGTTGCTACGTATGCGTGTAGCTCAGCGCGGCGCGCAACGTGACCTGGAGGCTGCGGCAGGCGCCGAGCTGGCGGCCCGTACCACTTTCTGTCTTGTGTGCCGCCTCAACCACCGTACCACGAGACACGCGCATAACCTCACCGACACTCACCGCGCCATGAAACGACTTCTGATGCCATTCTGCCGCATCTGTCGTATCACTTTCCGCTCACCCATGATTTACGAACACCATATTTGTTCCGTGGAACATCTTAAGAAAAAGGCCAGTCTTAACGCTCGACGGGCGAGCCCAAAGGCTGAAGCTAGTGCTGATGAGGGTATGGATGTGGATTTGGATAACTTCATGACGTTGGACTCTGTGGGTGATGTTGATGAAGTTGAAGATGATGACTCCGGCGGTGAGAAAAAAGATGAATCTGCCCCAAAAAAAACAAAAGTTGAGATAAATATTGGTAGCGAGCATATTAAGAAGTTAGAGGTTCACTGGTGCGAGCTATGTCGCGTGTATTTGCCGCGTGTGGAAGCTGGTAGTGCTGAGGAGGCGGAAGCTCTTCGCCGTCACTGCCGTCTGCGTGTCCACCTCGGTCGGTATGTGCAGCACCGGGACACGCGCACACTACGGCGCCACGCAGAGAGAATACACCGCCAGCTACACCAACAAAAGGAAGATGAAAAAGAAGTTGCCGCTTCTGAAGAAGTAGCCGATAAGGAAAAAATTGAAAAAAAGGAACCTTCTGTTGAAAACGCAAATTTGGAAAATGGAGCTGATCTGTCAAATATTTCTGGAAGCGAAGATAAATTGTGGGCTGATGTGGATAAGGATATTGGCGAGTTATTAAGAGAAGTGGATCCTCAGGGAAATGAAGCTAGTGACGATGACGAAGACCTTGGAAGGTATGATAAATTTCGTAAAAGTGATAAAAAACCAAAGGCTGATTTAGAAGAAGGTGAAGATGCTAAAGAAGAAATCTCCAATGAAAAAGCAAATGTAGAAGTAAAAACATCTATTTGA

Protein sequence:

>DPOGS203674-PA
MEVGVTTAAVEVVIEDHFVVTGVAADMMAAVGAEEAATQPITKIDTTVTMPTDIHQVGIVESSASYSNRDYGGRSGSPERKRMRMEHRSDANRIIYRAHRAIDVATMAAVTMAARTAVDRRVTASGGRSRARTGGGRRLESIASPVAWGRRESRSERSSGRGLRDAHSADEHCALAPSIGEPPVPVDPSPLGDLLKDRWGTPAHLELLRGEGNAKCKIMFKLVSDPMEGEPDSQSSVKSKEDEASSTEEDWEADEKEEVIEEKKETKNKSPEVSAPEVEGSEGEAGEGGEDTDKEPDAASDTAPSRPYVHLACVHCKEKCVTFGSYTKHLLSSKHRAAMSSVARRHKLELLRMRVAQRGAQRDLEAAAGAELAARTTFCLVCRLNHRTTRHAHNLTDTHRAMKRLLMPFCRICRITFRSPMIYEHHICSVEHLKKKASLNARRASPKAEASADEGMDVDLDNFMTLDSVGDVDEVEDDDSGGEKKDESAPKKTKVEINIGSEHIKKLEVHWCELCRVYLPRVEAGSAEEAEALRRHCRLRVHLGRYVQHRDTRTLRRHAERIHRQLHQQKEDEKEVAASEEVADKEKIEKKEPSVENANLENGADLSNISGSEDKLWADVDKDIGELLREVDPQGNEASDDDEDLGRYDKFRKSDKKPKADLEEGEDAKEEISNEKANVEVKTSI-