Monarch geneset OGS2.0

DPOGS208268
TranscriptDPOGS208268-TA1557 bp
ProteinDPOGS208268-PA518 aa
Genomic positionDPSCF300079 - 106063-114129
RNAseq coverage392x (Rank: top 31%)
Annotation
HeliconiusHMEL0021328e-12769.25% 
BombyxBGIBMGA006442-TA3e-3838.94% 
Drosophila% 
EBI UniRef50%
NCBI RefSeqXP_002000769.15e-0820.85%GI10409 [Drosophila mojavensis]
NCBI nr blastpgi|1951124161e-0620.85%GI10409 [Drosophila mojavensis]
NCBI nr blastxgi|1951124165e-0920.73%GI10409 [Drosophila mojavensis]
Group
Gene OntologyGO:00036771.3e-10DNA binding
KEGG pathway 
InterPro domain[242-289] IPR0036561.3e-10Zinc finger, BED-type predicted
[445-502] IPR0075881.8e-07Zinc finger, FLYWCH-type
Orthology groupMCL25246 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208268-TA
ATGTACCAAAACCACACATTCAGATTGATGTACAAGAAAGCATACGGCAGTCAGCTTTGGTACTGCTGGTACTTTTTCAGCAGATGCTGCCCGGCAAAGTTGAGGATAAATAACGCTGGAAGGGTCACAGAAACAATAGGGAAACATAACCACGACCCTCCAACTTGTTGGATGACACCCGATGGTGACGAGTTAGTGCAATGTGTTCCGACGAAGAAAGGTCATCTTTTGCTGTACAAAGGCTACACTTATGTTTTTAAAAGTAGATTACAGTGCGGAGCGGAGCAGTGGTATTGCTCATACCGTCAGAAAACGGGTTGTATATCTGTTATCAATATGCAAACATATGGCGTACAGGTGGAATGGATATCAATATATTTCGTTAAGAATGAAAATGAATACCGTTGTAATATATGCGAGGAGAACATCAAATGCGATGAAAAGTCACCGGAACTCCTGAAACATATAAAGTGCATGCACAGAGAAGTTTACGATCTCCATAAAAGCAATCCCGAATCGACTGTGGGTTTCCAAATAGAATTCTTGCAGTTTAACATGTTGAAGGACGACGATGGGAAGTCCCAGCCGGTTATATTGGAGGAGGTGTGCGAGTCCGATGAAATTGTTAAGTGTGAAGAAAAGACAGATAAAGATTTGATTAGCAACATGCAGTACGTGCTAGTGCCTAGGAAAAAGAAATTAAAAAAGAGTTATGAGTTTAGTCGTAAGCGGAGTTGGGTGTGGAAGTATTTTGATAAGTTGACCAACATTATATATAGATGCAACCTCTGCAACGTGGTACTATCCATCAAGGGTTGCAACACCAACAACATGAACCGCCACGTCAGAACCAGACACCCTGGTGTCTATAAATCTGAAGTGGAAAAGAAAAAAGATTCACAGGAATCGGAAAATCTTGATATGACTTGGAAGAAAGAGATAGATAGTGACAAGGAACTAGATGAGACTATTGACACCAAGCACGGTCGTAGCTGGATCTGGTCGTACTTCCAGCGCGTGACCAGCACACTCGCTCAGTGCAAGCTGTGCAAGAGGAACATCTGTCACGGTGGTAACGCCACCGGCAACATGAACAGGCATCTCAAAATGATACATCACAAGACCGCAGATGACAACAACTGGGTATGGAAAGTGTTTGAAAACACGGAGGAGCATTTTTTCTCTTGTAAAATTTGTAACTTTAAATGTATGAAGTTTGATGAGGTGGATAAGAGTATTAGGTGTATCTTGCAACACTTGAAAAGCGAGCACGGCGTCATATCCGGGGACCAAATCATAACGGGAACGGAATACGAGATGGAGACGGAAATAGTGACTATGTTTAACGGGAAGAAGGTTATACTACACAACGGAAATACTTATTACAAGAAGAATAAATCTGGCGTTTACATGAGATGGGCGTGTACGGGACACGCGAGCTGTAAAGCCTATCTGAAAGTAGACAAAGACCTCGTCATACGAGATCTTAACGAGAAGCACTCGCATGTGATAAGGAAACTCGTGAAAACGTCCACGGGAAGGTACATCCGGTTGTAG

Protein sequence:

>DPOGS208268-PA
MYQNHTFRLMYKKAYGSQLWYCWYFFSRCCPAKLRINNAGRVTETIGKHNHDPPTCWMTPDGDELVQCVPTKKGHLLLYKGYTYVFKSRLQCGAEQWYCSYRQKTGCISVINMQTYGVQVEWISIYFVKNENEYRCNICEENIKCDEKSPELLKHIKCMHREVYDLHKSNPESTVGFQIEFLQFNMLKDDDGKSQPVILEEVCESDEIVKCEEKTDKDLISNMQYVLVPRKKKLKKSYEFSRKRSWVWKYFDKLTNIIYRCNLCNVVLSIKGCNTNNMNRHVRTRHPGVYKSEVEKKKDSQESENLDMTWKKEIDSDKELDETIDTKHGRSWIWSYFQRVTSTLAQCKLCKRNICHGGNATGNMNRHLKMIHHKTADDNNWVWKVFENTEEHFFSCKICNFKCMKFDEVDKSIRCILQHLKSEHGVISGDQIITGTEYEMETEIVTMFNGKKVILHNGNTYYKKNKSGVYMRWACTGHASCKAYLKVDKDLVIRDLNEKHSHVIRKLVKTSTGRYIRL-