Monarch geneset OGS2.0

DPOGS216199
TranscriptDPOGS216199-TA4473 bp
ProteinDPOGS216199-PA1490 aa
Genomic positionDPSCF300080 + 220035-227025
RNAseq coverage141x (Rank: top 55%)
Annotation
HeliconiusHMEL0163270.080.09% 
BombyxBGIBMGA004545-TA0.071.38% 
Drosophilaci-PA5e-12545.50% 
EBI UniRef50UniRef50_G6DJF40.099.20%Putative uncharacterized protein n=25 Tax=Obtectomera RepID=G6DJF4_DANPL
NCBI RefSeqXP_970110.28e-14443.54%PREDICTED: similar to zinc finger protein [Tribolium castaneum]
NCBI nr blastpgi|41767600.092.60%cubitus interruptus [Junonia coenia]
NCBI nr blastxgi|41767600.094.75%cubitus interruptus [Junonia coenia]
Group
Gene OntologyGO:00036763.5e-15nucleic acid binding
KEGG pathwaytca:6586532e-143 
 K06230 (GLI)maps-> Basal cell carcinoma
    Pathways in cancer
    Hedgehog signaling pathway
InterPro domain[302-332] IPR0130873.5e-15Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL15776 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216199-TA
ATGTTATATACATATACTTTTCTTCTTCTAGGTTTGGAATATTTGAGTGCAGCTAGGAGTTTGCATCCGGAATTACATGCAGGGAGTACATTAGCCAGTCAAGATTTTCAACTTAGTCTGGAAGGATCGAGAATAGCATCTCCCAATCGTTTGAGGTTATCCGGCGGGGCAATAGGTGCGTCGGCAAATAGAAAACGAGCAGTTTCATGGAGTCCCTATTCTGCTGAGTCGTTAGACCTGGCAGCTGTCATCAGAGCATCACCTGCTAGCTTGGCTGTCAGAGCACCTTCAGCGGCTTCCACTGGCAGCTATGGACACCTTAGTGCTGGTGCAATATCTCCGGCGCTATCGTTGTCACATGCGTCTCTTGCTCAACAACTTTTGGCGAGAGGTGGTGTCGGTGGTAGCAGTGTGCTTTCTGGTGGTGTGCTGCTTGATCCTGCTCATCAACAGGCTGCGGCAGCTGCGGCACACCATGCTGCCCATGCGCATCTTGTCGCTGGGATCCACCGATCTCATATATCTTCACCGACTCAGCTCCTCATAGGTGGCCCAGTGGATGTTCGAAATGGTTTGGGTTTAGATGGAACGCCTCCTCATATGCAACAGCCACCACAGCAACCCGAAATCACTAGCGTCATGGAAGCGGATAGTGCATCAACAGCGTTAACGCAGCGCAAATCGCCCCAAGTATTGGTTTCTCATAGAGACAATATGCATGGCAATAAACCTTTATCAGCAGCTGCCGAAAGTACTGTACATGACGGGCTTGACTCTAAGGATGAGCCTGGGGATTTTATAGAGACAAATTGTCATTGGGTTGATTGTAAACTTGAGTTTCCAACACAAGACGATCTCGTAAAACATATTAACACAGATCACATCCATGCTAGTAAAAAAGCCTTTGTCTGTCGCTGGGTTGGGTGTTCCAGAGATGAAAAACCTTTTAAAGCTCAATATATGCTGGTAGTTCATATGAGAAGACATACAGGAGAAAAGCCCCATAAATGTACATTTGAGGGTTGCTGCAAAGCCTATTCGCGTTTGGAAAATTTAAAGACTCATCTGCGAAGTCATACCGGAGAGAAACCTTACACTTGCGAGTATCCTGGATGTGCGAAAGCTTTTTCCAATGCCAGTGACCGTGCTAAGCATCAGAACCGAACGCATAGTAATGAGAAACCATACGTGTGCAAAGCTCCAGGATGTACGAAAAGATACACAGATCCATCATCTCTCAGAAAACACGTGAAGACGGTACATGGCGCAGAATTTTATGCCAGTAAAAAACATAAAGGTTGTAGTCGTGGGGATGACTCAGCAGAATCTGGTGGAGGTTGTGCAGGTTCTTCACCGCGATCGGAAGAGGGAGGTGCTCTTATCAGAGGTCACACCTCTTCAGCCTCTGTCAAGAGCGAAAGTCCAGCTTCACCGCTACCGCTCATGCACACCGCCGCACATCAGCTGTCAGCACAGTGCGGTGGAGATTTGGACTTTGGTGGTTCAGGACTAGGTGGTTTTAGTGACGAACACGGAGCCCCATACTTCCGACTTGATGGTGATGTGGAACAGGAAGTGGTTGGTGAGGTTGGTCAGCTTCCGCTGATGCTTCGTGCGATGGTTGCTATTGGAGAACCACGACATGGACCAAGATTTGGTAACAAAATGGCCCTTGGAAGACTAATGCCTTCTGTACATGATATGGGTGCCGTTCAAGGAAGAACAGAATTAGGCAGCACTAACGTAGCGGTCGAACTGAAGACTGGACTCCCAAATACAAGACGGGATTCCGGAATTTCATCAGGAAGCAGCTTGTATAGTGCGAGATCTTCAGATATTTCACGCAAAAGTAGCCAAGCATCTGTAGTGTCAGGTGCAAGACTTGCCTCTCAACATACTAACGTATATGACCAACTATCACCTGATAGTAGCCGAAGGCAAGTATTTATATATTATCTCTATAAATCTAGTCAAGTTTCATGCGTAGGATATGCACCACCACCATCATCAGCTTTAGCAGCAGTACAAGCTGTAAGAACCTCGCAAGGCCATCAGGCTGTATTACTACGCGGTGTGACCTGTTCCGAGGTGAGAGCAGAAGAATTGGCTCTCGAATTAGATCCCAATGTGCAGGTTAAAGAAGAAGCTAGAAGACTTTCAGAGCAATCAAATCTAAGTGACCAAGCTCAAAGCTATCAACCTTATCCCTCTAATAATGATGACGTGGGTGAGACTCCATTCCCATTTAAAACAGAAGACGACCATGAAATAGCCTATAAGGAATCCCGTTCAAATTCGACTAACACCGTTGTTATTACTACAGCCCAAGTTCATCACCCAAATCAAGAAGTAAATCTAGAACAGGTTGCTGAAGGAGAAATGGTGGAGAACAAGCTGGTGATACCAGACGAAATGATGCAATATCTTAACCAATCAATATTGGGAACCGACTCGGTAACACCAGCCAAAACGGACTCTACCAACCTTAAACCTGAAGCAAGTAAAGACAATGACACAACCAAAGACATTTTAACCAGTGAAACACAAACAAACAATTCCAGTGACAAAATAAGTGATGTCGCTACTAGTGACGATTCTCTTCTAAAAAATCTAGGTGCCATAGGAAGTGATCTCAACATAAGTGATATTCAAGTCGATTTAAGGTCTTTAGACGTTAGTATGTCTGGTAATAGCGGCTCTCTTTTAGCTGCCAAAAGTCCAGATGAAAAGAATCCTCCACTTCAAGAACAGGTTATATCTGAAGTTGCAAATGAAAAATGCGAAGATTACTTAAAACCAACCGCTAATAATGCAGCTAGTAATCCTTTGCAGTCATTGCAAACGATGGCTGCCAATCAAACTGAGCAATCGAACAGAATGAGGATTAATAATTCTCTACCTCAAAAGCCAGCTATTAGTCCTAAGACTATTATTATGACACCTCAAACAGTTATGAGTCCTAATTTGGTCCATAGTATGTTAAGTCCACAGAGTCTACCACATAGTTCTATGAGTCCACAGAGTATAAGAAGTCCACAGCATATGCCACCAGGTATAATGAGCCCTCCAAGTATATATAATGTTATGAGCCCTCAGAGTGTAATGTCTGTTATGTCTCCGCAACATAATGCTATGAGTCCACAGAGTATGCCAAGTTTAATGAGTCCGCAAATGCCCAACCAAATGATGAGTCCGCGTAACAACAACATTGCAAGCCCTATTACGCAAAACATGGGAAGTCCAATGATGAATATAACCAGTCCTCTGAATCAAAACATAGCAAGTCCGATGAGCCATGGAATGCCTAGTCCAATGCATCCGGGTCTCCAAAGCCCTAATCCTATGGTACAAAACCTCTCTAATATGCCAAGAAATTCACAAGCGATTCAAAATCAAAATCAACACCACATGATGAATATGAATCAAACACAGCAATTGGCGCAACAACAAGCATATAATAATCATCAGAATTGTAAACTCCCAAGCAAAAATTACAACAATGTTCCTAACCAATACCAAAACCAAACATACACTCAAGCAGCAGCATATCCTACGCAAAATCAAATGAATATAAATATGAGACATCAAAATTTACAGCAATACCAAATGATGCAGCAATTTAATCAAAACCAGTCACAGATACCGATAATGCCTCCAAATAATTCGAATATGGCTTACGGCAACCAAATGATAAATTACGCTCAACATGGCAGTTATACAAACCAGAGTAATCAATTGCATCCGATGCAAATATCGAGGTCTTCTATGATGAGCGTCGACAATAGCGGCAATATGAACCGAGGGTCGTTAAATGGTTATTACGAACAGACAATGAGTCCGTCTGCACAGAATCCTCAGTACAATCAAAATGTACAATATCCAGCTCCACCACCATACAACGCGGTGAACAATGCAGCAAATGTAATGGGTCCGCCGCCACCTAAAAACAATCATCAATATAATCAAGCCATGATGAACAACAATCAATATTATAACAACCAGAGATCTTACAACCAATGGGACTATCCGGGTAATCAGTTCAATAAACATAATATGCAAAAATCGACACAAAACTCAGTCAACATGTCAACTGGTAGTCAGAAACCTGTCATGAACGGTACAAGAATATCAATGAATTGCAACCAAATAATGAAAAATCCTGGTGAACAAACTGATTGTAGCATGAACAGCTTACGGAGTCAAAATAACCAGGCTGATGTTCAAGTGTGGGATATATCTCAATCTCAAATAGAGGCTACGAACGGTAGGAAGAAGAATCAGAATACTATGCGTCAAGAAACTTATCAACGGACTTTAGAATATGTTGAGAGTTGCGAAAACTGGAAGAGTTCTGAAATAGTTTCTAGCAGTACACATCCTCTACAAGGCGGAGACAATATGGTTGTCAACGACCTCAGGACTTCCTTATCTTCGTTTTATGAAGAGAATCAATATTTGCAAATGATTCAATAA

Protein sequence:

>DPOGS216199-PA
MLYTYTFLLLGLEYLSAARSLHPELHAGSTLASQDFQLSLEGSRIASPNRLRLSGGAIGASANRKRAVSWSPYSAESLDLAAVIRASPASLAVRAPSAASTGSYGHLSAGAISPALSLSHASLAQQLLARGGVGGSSVLSGGVLLDPAHQQAAAAAAHHAAHAHLVAGIHRSHISSPTQLLIGGPVDVRNGLGLDGTPPHMQQPPQQPEITSVMEADSASTALTQRKSPQVLVSHRDNMHGNKPLSAAAESTVHDGLDSKDEPGDFIETNCHWVDCKLEFPTQDDLVKHINTDHIHASKKAFVCRWVGCSRDEKPFKAQYMLVVHMRRHTGEKPHKCTFEGCCKAYSRLENLKTHLRSHTGEKPYTCEYPGCAKAFSNASDRAKHQNRTHSNEKPYVCKAPGCTKRYTDPSSLRKHVKTVHGAEFYASKKHKGCSRGDDSAESGGGCAGSSPRSEEGGALIRGHTSSASVKSESPASPLPLMHTAAHQLSAQCGGDLDFGGSGLGGFSDEHGAPYFRLDGDVEQEVVGEVGQLPLMLRAMVAIGEPRHGPRFGNKMALGRLMPSVHDMGAVQGRTELGSTNVAVELKTGLPNTRRDSGISSGSSLYSARSSDISRKSSQASVVSGARLASQHTNVYDQLSPDSSRRQVFIYYLYKSSQVSCVGYAPPPSSALAAVQAVRTSQGHQAVLLRGVTCSEVRAEELALELDPNVQVKEEARRLSEQSNLSDQAQSYQPYPSNNDDVGETPFPFKTEDDHEIAYKESRSNSTNTVVITTAQVHHPNQEVNLEQVAEGEMVENKLVIPDEMMQYLNQSILGTDSVTPAKTDSTNLKPEASKDNDTTKDILTSETQTNNSSDKISDVATSDDSLLKNLGAIGSDLNISDIQVDLRSLDVSMSGNSGSLLAAKSPDEKNPPLQEQVISEVANEKCEDYLKPTANNAASNPLQSLQTMAANQTEQSNRMRINNSLPQKPAISPKTIIMTPQTVMSPNLVHSMLSPQSLPHSSMSPQSIRSPQHMPPGIMSPPSIYNVMSPQSVMSVMSPQHNAMSPQSMPSLMSPQMPNQMMSPRNNNIASPITQNMGSPMMNITSPLNQNIASPMSHGMPSPMHPGLQSPNPMVQNLSNMPRNSQAIQNQNQHHMMNMNQTQQLAQQQAYNNHQNCKLPSKNYNNVPNQYQNQTYTQAAAYPTQNQMNINMRHQNLQQYQMMQQFNQNQSQIPIMPPNNSNMAYGNQMINYAQHGSYTNQSNQLHPMQISRSSMMSVDNSGNMNRGSLNGYYEQTMSPSAQNPQYNQNVQYPAPPPYNAVNNAANVMGPPPPKNNHQYNQAMMNNNQYYNNQRSYNQWDYPGNQFNKHNMQKSTQNSVNMSTGSQKPVMNGTRISMNCNQIMKNPGEQTDCSMNSLRSQNNQADVQVWDISQSQIEATNGRKKNQNTMRQETYQRTLEYVESCENWKSSEIVSSSTHPLQGGDNMVVNDLRTSLSSFYEENQYLQMIQ-