Monarch geneset OGS2.0

DPOGS208604
TranscriptDPOGS208604-TA1740 bp
ProteinDPOGS208604-PA579 aa
Genomic positionDPSCF300052 - 5558-22449
RNAseq coverage148x (Rank: top 54%)
Annotation
HeliconiusHMEL0092270.086.68% 
BombyxBGIBMGA013385-TA2e-15984.17% 
DrosophilaLin29-PC2e-13861.67% 
EBI UniRef50UniRef50_UPI00020641329e-14572.75%UPI0002064132 related cluster n=2 Tax=unknown RepID=UPI0002064132
NCBI RefSeqNP_726568.22e-13963.17%CG2052, isoform B [Drosophila melanogaster]
NCBI nr blastpgi|3287861013e-14472.75%PREDICTED: hypothetical protein LOC100576251 [Apis mellifera]
NCBI nr blastxgi|3287861011e-15071.03%PREDICTED: hypothetical protein LOC100576251 [Apis mellifera]
Group
Gene OntologyGO:00036768.2e-14nucleic acid binding
GO:00082702.2e-05zinc ion binding
GO:00056222.2e-05intracellular
KEGG pathway 
InterPro domain[92-126] IPR0130878.2e-14Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL16014 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208604-TA
ATGAAGTGTGATTTGTGGGGATATAACATGTTCGATAATGCATCAATACTATCACAGATCTTCATGCACAAGAACGCCATGCAAATGCATGCACGCGAGCTGCAGCGCGGCCTCGGTGGTGAGGTGAAGCCGCACCAGTGCCAGCAGTGCTTGAAGTCGTTCAGCTCGAACCACCAGCTCGTCCAGCATATCAGGGTCCACACGGGCGAGAAGCCGTACAAGTGTTCCTATTGTGATCGACGGTTCAAGCAGCTCAGTCACGTTCAGCAGCACACCAGATTACACACAGGGGAGCGTCCGTATAAATGTCATTTGCCTGATTGTGGCCGTGCCTTCATCCAGCTGTCGAACCTCCAGCAGCATCTGAGGAACCATGACGCTCAGGTGGAGAGAGCAAAAAATAGACCATTCCATTGTAATATCTGTGGGAAAGGATTCGCGACTGAGAGCAGCTTGCGTACTCATACAGCCAAGCAACACGCGGCGCTCATGATAGGGGGCGCTACGGCCACCCCGTGTCCTATATGCCATAAAGTTGTTTTCGGGGGCGAAGCCCTAGTAGAACATATGAAGAACACGCATAAGGACCCGAACGCATCCGGTGTTGCGAGTCCGCCGGCCACGTCACCGTATCCTAAACTGGACCCGTATGTAGCGAAGCGGCGTACGGCGAACCACCCGTGTCCTGTCTGCGGGAAGCACTACGTGAACGAGGGTTCGCTGAGGAAGCACCTCGCCTGCCACCCAGAGACACAGCTCACCAGCGGACTGAGGATGTGGCCCTGCTCCGTCTGTCAGGCCGTGTTCACACATGAGAGCGGTCTTCTATCCCATATGGAGCACATGCGGATGGAGCCTAAGCATCAGTTCGCTGCTCAGTACGTGCTGTCACGAGCGGCGGCCGAGAGACGGGAGAGAGACCTCATTGCCGCTGTATCATCAGCTGGGGGCTCTGGACTCTTGAACCTGGCACCCCCATCACCAGCACACTCCGACTCATCATCCAACGGACGGCTCTCATCATCCGCCGGATCTGACGCCGGCGCCGCCGTCAACAAACTATCAGACCTCCTCCGCGCCAACAACGGCCAGTACGGAGACGAGAGGGTCGCGGCCATAGCAGCGGCCGCCGCCAACATGATGAGCCAACCAGGTGAGAGCAACAACGCTGTACAGGTAGCCGCTGCCAATCTAGTAACGGCCATGAGACAACAGCTGGCGAGGGAACCGCAACCGGAAACACCTCCAGCTCAGGCGGAAGCGGCGCTCAGGATACAACAAGCTGAAGCTCTGCTACGGAGTCAGGCGGAAGCGTTACGGTTGGCCGTGTCACAGGCCGCCGCCGCTCATAACACCGAAACACCCAGCCCCTTGCGACATAACGGGGGATTCCCCCAACCGAACGACGCCAGCGGACAACTATCACCTGAATTGGTCGAAGCCTTCAGAATCGCACAGGAACAGAGACTCGAACAAGCGCTGCGACTTCACGACCCGAGAATGTTGGGCTTCAACATACCGTCGCCGGTACAGGCGGCGCAACAAGTCGCCGCTCAGGCTCAGGCTCAAGCCCAAGCCGCCCAAGCCGCCCAAGCCGCTCAAGCCGCTCAGGCAGCGCAGGCCGCTCAACAAGTCGTACAGCAGCAACAAATTCAAGCGGCACAAGCAGCACAAGCGGCACAAGCGGCTCAACAAGCTGCACAACAAGCTGTCCACCTCCAACAGAACCCACAACCATGA

Protein sequence:

>DPOGS208604-PA
MKCDLWGYNMFDNASILSQIFMHKNAMQMHARELQRGLGGEVKPHQCQQCLKSFSSNHQLVQHIRVHTGEKPYKCSYCDRRFKQLSHVQQHTRLHTGERPYKCHLPDCGRAFIQLSNLQQHLRNHDAQVERAKNRPFHCNICGKGFATESSLRTHTAKQHAALMIGGATATPCPICHKVVFGGEALVEHMKNTHKDPNASGVASPPATSPYPKLDPYVAKRRTANHPCPVCGKHYVNEGSLRKHLACHPETQLTSGLRMWPCSVCQAVFTHESGLLSHMEHMRMEPKHQFAAQYVLSRAAAERRERDLIAAVSSAGGSGLLNLAPPSPAHSDSSSNGRLSSSAGSDAGAAVNKLSDLLRANNGQYGDERVAAIAAAAANMMSQPGESNNAVQVAAANLVTAMRQQLAREPQPETPPAQAEAALRIQQAEALLRSQAEALRLAVSQAAAAHNTETPSPLRHNGGFPQPNDASGQLSPELVEAFRIAQEQRLEQALRLHDPRMLGFNIPSPVQAAQQVAAQAQAQAQAAQAAQAAQAAQAAQAAQQVVQQQQIQAAQAAQAAQAAQQAAQQAVHLQQNPQP-