Monarch geneset OGS2.0

DPOGS213639
TranscriptDPOGS213639-TA1935 bp
ProteinDPOGS213639-PA644 aa
Genomic positionDPSCF300165 - 132600-139107
RNAseq coverage2849x (Rank: top 4%)
Annotation
HeliconiusHMEL0045866e-7550.83% 
BombyxBGIBMGA004581-TA8e-13764.91% 
DrosophilaCG33521-PA2e-7934.44% 
EBI UniRef50UniRef50_D6WYU22e-11340.78%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WYU2_TRICA
NCBI RefSeqXP_001947503.13e-12443.33%PREDICTED: similar to CG33521 CG33521-PA [Acyrthosiphon pisum]
NCBI nr blastpgi|3287202783e-12343.33%PREDICTED: hypothetical protein LOC100161216 isoform 3 [Acyrthosiphon pisum]
NCBI nr blastxgi|1892397996e-13140.35%PREDICTED: similar to LIM domain protein [Tribolium castaneum]
Group
Gene OntologyGO:00082701.4e-16zinc ion binding
KEGG pathway 
InterPro domain[62-125] IPR0017811.4e-16Zinc finger, LIM-type
Orthology groupMCL16823 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213639-TA
ATGGCAACAACAATGGAGGCGTCTCAGTCTCTCATCCAGTCGGAGTCCAGAGTCAGTTACCAGGAGGTGCACGTCACGGAGAGGACCAAGAAGAAAAGCAAGACACGACGGCATAAGGACGAGGGATCTATATCAGTGTCCAAGAGCTCCGAGAAGCTGTTCAAGAAGATGAAGGCGGCAGAGGGAGACAACCCGACCTGCGCCAAGTGCGCCCGGCCCGTGTACGCCATGGAGAGGGTGAAGGCGGAGAGGCGCTCCTGGCACAGGGACTGCTTCAGATGTGTACAGTGTGACCGACAGCTCACCGTGGAGTCCTACGAGAGCGACCACACCGCCCTGTACTGTAAGCCACACTTCAAGCAGCTGTTCGAGCCCAAGCCGGTGGAGCACGACGAGCTGGACGCGGCTCCCAAGAAGCACCAGATGATTATATGTGAGAGCCAGCCGGTGGAACTTCCGCCGGACGTCGTCAGAGCATCAGACAAACCCGAGCTGGGCCTGGAGGAGCTGGCGGCGCTGGACGTGAAGTCCAGGTTCCAGGTGTTCGAGAAGAAGAACACGGAGGAGGTGAAGAAGGAAGACCTGGGACGAGCGCCCAAAGGGAAGAGCGCCGCCGTACTGGCGAAGATGGCCAAATTCAAAGCCAAAGGTATGGACATCGGCGTGTCGGAGGAGGCTCTGAACGGAGTGACCCTGGAACCCTCCTCCAGCGACCAGGAGGACGACGACGACGATGACTCGGTATTGAAGAAGTCGTACTCCCACAAGGCGATGGTGGAGGCGCCCTCGGTAGAACTGTCCGAGCTGGTGGGCAGGTTCGAGAGACCCAGACCCTCCAGGCATCAGCAGAGGAAGCAGGAGATACAGAACATTAGGAGCAGGCTGTTCCTGGGGAAACAAGCCAAAATAAAGGAGATGTATGAACAGTCGGTGCTACAGTGTGAACAGAGTGTGACGTCAGCGGACAAGATCGCCAAGGAGCTGGAGGTGGACGCGGAGAAGGCGCGGGCCATCAAGCAGAGGTTTGAGAACGGGGAGATATACAACGACGAGAACCGGCCGCCTAGGAACAGAGACCTCGACGACAGAGCGCTCTTCGACGACGGTATCGCCAAGAAGTCTCGTTCCATCTTCCTGGAGCTGGACGCCAACGCCAAGCACGCTTCGCCCGGAACCCCCGGCACCCCCACCGCCGAGCCTCCAAGGAGGAAGGAGATACCCTTCGTGCCCAAGGACGTGATCCGCGCCGCCGACAAGCTGGAGGACGTGAAGATAGAAACCTCCGACATACACGACAGGTTCCAGTTCTTTGAGACTTACAAGCCGCAGGTGACGCGCAAGGAGTTCCGGGTCACGCCGCCCAGAGAACCACAGCGCCCCAAGTCACCCTCGCCAGAGATCTACCACGAGCCGAGTGTGGTGAGGAGCGACGAGCCTCCCGCGGACGCCGGGCTGGCGGCCAGCAGGCACACGGCGTCCAAGATGATAGACGTGTTCCGGAAGATGGAACAGGACAGGAACAGACCCGACGACTGCCAGGGTCCGAAGCCCCTGAAACGCTTCACGCCCCCTCCCCCGGGAGAGTCCGAGCGACACGGGAACACCAGCACCTCCTCCGAGTACAGCGAGGACGAGGACAGCGACGACGAGGGGCTCAGGAGGTACCAGCAGGCCCGGGAGCAGGACGAGGCGCTCAGACAGGCCCAGCAGTTGGCGCGCACCAAAAGCTTCAGGGACAGATTCGAGAACTGGTCAGAGAGCGAGAGAGACGCCGAGCCACGCTCGCCCTCCGCTGACCCGAGACACCTGCTGGACGACGGACAGTCACAGCTGGAGACGGCTAAGAGCCTCCGGGAGAAGTTCGAGATGATGAAGATGCAGACGACCGTCACCAAGTCCGTCACGCCCAAGGTCAACAGATTTGTGGTAAGCTGA

Protein sequence:

>DPOGS213639-PA
MATTMEASQSLIQSESRVSYQEVHVTERTKKKSKTRRHKDEGSISVSKSSEKLFKKMKAAEGDNPTCAKCARPVYAMERVKAERRSWHRDCFRCVQCDRQLTVESYESDHTALYCKPHFKQLFEPKPVEHDELDAAPKKHQMIICESQPVELPPDVVRASDKPELGLEELAALDVKSRFQVFEKKNTEEVKKEDLGRAPKGKSAAVLAKMAKFKAKGMDIGVSEEALNGVTLEPSSSDQEDDDDDDSVLKKSYSHKAMVEAPSVELSELVGRFERPRPSRHQQRKQEIQNIRSRLFLGKQAKIKEMYEQSVLQCEQSVTSADKIAKELEVDAEKARAIKQRFENGEIYNDENRPPRNRDLDDRALFDDGIAKKSRSIFLELDANAKHASPGTPGTPTAEPPRRKEIPFVPKDVIRAADKLEDVKIETSDIHDRFQFFETYKPQVTRKEFRVTPPREPQRPKSPSPEIYHEPSVVRSDEPPADAGLAASRHTASKMIDVFRKMEQDRNRPDDCQGPKPLKRFTPPPPGESERHGNTSTSSEYSEDEDSDDEGLRRYQQAREQDEALRQAQQLARTKSFRDRFENWSESERDAEPRSPSADPRHLLDDGQSQLETAKSLREKFEMMKMQTTVTKSVTPKVNRFVVS-