Monarch geneset OGS2.0

DPOGS208659
TranscriptDPOGS208659-TA2829 bp
ProteinDPOGS208659-PA942 aa
Genomic positionDPSCF300281 + 196323-204850
RNAseq coverage376x (Rank: top 32%)
Annotation
HeliconiusHMEL0117480.058.57% 
BombyxBGIBMGA007756-TA1e-5945.23% 
Drosophilapita-PA4e-6337.78% 
EBI UniRef50UniRef50_D6X4L65e-9136.99%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6X4L6_TRICA
NCBI RefSeqXP_970032.25e-9236.99%PREDICTED: similar to zinc finger protein [Tribolium castaneum]
NCBI nr blastpgi|1892416949e-9136.99%PREDICTED: similar to zinc finger protein [Tribolium castaneum]
NCBI nr blastxgi|1892416944e-9333.44%PREDICTED: similar to zinc finger protein [Tribolium castaneum]
Group
Gene OntologyGO:00056342.7e-13nucleus
GO:00082702.7e-13zinc ion binding
GO:00036768.4e-08nucleic acid binding
KEGG pathway 
InterPro domain[13-88] IPR0129342.7e-13Zinc finger, AD-type
[224-261] IPR0130878.4e-08Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL15868 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208659-TA
ATGGAAAAGAGAAAAGAATCGTTAAGAGCTGCAAAAGTGTGCCGATTTTGCCTATCACAAGATCAATCATTGGACAATTTGTATGACAGAAATAGAGCGCCAAAGAATGCTATTAACTTACATCTTAAAATATTATCATGTGTTGCAATCGAGGTATTTCCCTCAGACAAAATGCCTGCTTACATTTGCCATCGTTGCAAAACTTTTATGACCTTGTTCTATGACTTCAAACAAATAGTACGGAGAGCTGATGAATCAATCTTGCAATTTTTACAAAACGGTACACCTATGGAGACAATTTTATGGCCCTCTTCCTTGGCTAAAATCATACCAAGTCCTAATGAAATAAATGTTAAAACCATAGTAGAAGATGGTACAACTATCCAAGTTTCATCCCAAGATATCTCTGATAGCGATGAAGAGGACGGGAATGTGTATAATGTTAAGATAGGGGATGGACCGGATGATTCGAATACAACTTGCATTAAAGTTGTCACAAGCAAGGAAGAAGTGAAAGAGGATTCCGTACAAAGTGACCGCGAGGTGTGCTGGCCGTGTGACGAGTGCGACTGCACCTACCCTCTCCAACAACTCCTGGCTTTACACAAGCGACAGAAACACAGGCCACGTACAGTTGTATGTGATAAATGTGACGCGAAATTCTTCTCAAAGTATGATCTTTCTACTCATCTCCTCCGACACACCGACGAGACGCCGTTCCAATGCGTGGCGTGTGATAAGAAGTTCAAACGGCTGATCTTACTCAAACGGCATGAAAAGATAATTCACGCAGATTTGCCCCAGCAAGTCTGTCCAAACTGTCCAGCGACGTTTCTGTCCATCGAAGAGTTGGAAGCGCATAAAAAAAGGCACATCCACATAGAGAGACCGTATTCTTGTAACGAATGCGACAAAAAGTTTCACGAAAAGGCGACGCTCCAGCGACACATACAAGTGGTTCACAACAGGGAGCCCACTTACTGTTGTGGGTATTGTCCAGAGCGTTTCGTGTCGATGTCCAAGCTGACCAAGCACGTGCGCTCCCACGCCGGCGACCGACCCTACCCCTGCAAGTTCTGCGATAAAAGTTTTACTAAATCCCATCACTACACCAGACATCTCCGTGTGAAGCATCGTTCCCGTCACACCGAACAGTACCGCTGCGAGCAGTGCGACGACACCTTTACGAGTCAAGACGACCTCATCTATCACTCGGCTATCCACGCCACACAGAATCTCATCTGCCCGCTCTGCCAGGAGAAATTTGACAACGTCGATGACGTCACCACCCATATCAAGTCGCACGTCAGCGGCGTGGAGTTCGAGTGTGATTTCTGTGAGCTGGTGTTCACATCCAAGGAGAAACTTGACAATCACTTGATTACTGCGCACGAGGATGAACTGCAGCAGGTGGAATTGGATGAAATGGAAGAATTGGAAGAATCGTCTATAGAAGAAATAGAAGAGGATAACGGAATGAACTTGAAAGACGAAGGCGATCATATGGTTATTGAAATCAACAAAGAAGATTACATGATAAACAAAACCTCGGAATCCGACGTAAAGGTTGTTAACGCACAATCTGAAGATAGCGAATCAGAGAACACGTACACGGAACTAGCTACGGTGGATACATTGGCTGTCCTGAAGAAAAATGATGCGGCTAAAACAGTTGCCGAGAAGTCTGAACCGAAGGCTGGCGAACAAAAACCAGTCACAAATACTAAATCAGAAGTACAAAAAGCATCTCCAAAGAATGTTGAGATGTTGAGGACTAAACGACCGCAACTACAGACATCCACACCCAAAAGATCGGCCGAGGAAAAGAAATTCACTCTAGTGAACAAAACTCCGAATGTCGACAAGAAGCCGGAACGCAGGATAACCAAAGAAAACAAAGAGCCAAAAGATGTGAAGGAAACAAAGAACAATAATGGAAATAAAGATGACAAGGAATCACCAAAGAGCGTCATAAAGAATGGTTCTAATACTGACAAAAACGCCTCCGACGATGGAATCCGGCGGTCAACACGACCCTCTAAGATAAAGGATTACGCGAAAATGGTGCGAGACAAGTCACAGACATCCTCCGACCTGGATGAGAGTAGTGACGAAGACGAGGAGTATAGAGAGAGTGATAGGAGTATTGAAAGCCGTACTAAGATACGAAGAATTAATCCTGTCAAAACAAAGCCTGCAGATGTATCCCCTATACCAGCAACAGCTCCTCGAAAGAGAGGACGACCAAGGAAGGAATCTAAGACAACCAAAGATGTTCCTGCTAAAGTGAGAAAGGATGACGGTGAAGCAGCTGATACTGAAAAAGTAGAAGAACCGTCAGCAAAAATTAAAAGTGCAGTTAAAGATGAGAAAAAAGTCGTGGACACTGAACCGAACGAAACTACACAAAATACACCAAAAACACCTCCCGAACAAAAACCTGCTGCTGGAGATTTACTTGTATCTCCATCTGGACAAACACTGAAAAAGGTCCCAATCAAAGCATTACCTCCAGGAATAAAACCTCTACCTCTACCAGTGAACGCTAGGCCTGTAGCGTCCGGAGAACTTTGTGAAATGCAAATTGGCAAGAAAGTTGTGAAAGTACAAAAAATCGTAATGACGAAAGCGGAAGTAGAAGCTATGGCTAAAAAAGGACTCGTTGAAATGAAGGACGGTACTATGGTATTAAAACAAGGAATAAAACTGCCCAGCATGGAAAATTCTACTGCTAAAACATCCATAGCAGACAGTGACGCAGCAAAGGAATCTCTGGTGAAGAAAGAAAAAGCGGTGCCTACACGCTGCGACCTGGGTGATGATTCTTAA

Protein sequence:

>DPOGS208659-PA
MEKRKESLRAAKVCRFCLSQDQSLDNLYDRNRAPKNAINLHLKILSCVAIEVFPSDKMPAYICHRCKTFMTLFYDFKQIVRRADESILQFLQNGTPMETILWPSSLAKIIPSPNEINVKTIVEDGTTIQVSSQDISDSDEEDGNVYNVKIGDGPDDSNTTCIKVVTSKEEVKEDSVQSDREVCWPCDECDCTYPLQQLLALHKRQKHRPRTVVCDKCDAKFFSKYDLSTHLLRHTDETPFQCVACDKKFKRLILLKRHEKIIHADLPQQVCPNCPATFLSIEELEAHKKRHIHIERPYSCNECDKKFHEKATLQRHIQVVHNREPTYCCGYCPERFVSMSKLTKHVRSHAGDRPYPCKFCDKSFTKSHHYTRHLRVKHRSRHTEQYRCEQCDDTFTSQDDLIYHSAIHATQNLICPLCQEKFDNVDDVTTHIKSHVSGVEFECDFCELVFTSKEKLDNHLITAHEDELQQVELDEMEELEESSIEEIEEDNGMNLKDEGDHMVIEINKEDYMINKTSESDVKVVNAQSEDSESENTYTELATVDTLAVLKKNDAAKTVAEKSEPKAGEQKPVTNTKSEVQKASPKNVEMLRTKRPQLQTSTPKRSAEEKKFTLVNKTPNVDKKPERRITKENKEPKDVKETKNNNGNKDDKESPKSVIKNGSNTDKNASDDGIRRSTRPSKIKDYAKMVRDKSQTSSDLDESSDEDEEYRESDRSIESRTKIRRINPVKTKPADVSPIPATAPRKRGRPRKESKTTKDVPAKVRKDDGEAADTEKVEEPSAKIKSAVKDEKKVVDTEPNETTQNTPKTPPEQKPAAGDLLVSPSGQTLKKVPIKALPPGIKPLPLPVNARPVASGELCEMQIGKKVVKVQKIVMTKAEVEAMAKKGLVEMKDGTMVLKQGIKLPSMENSTAKTSIADSDAAKESLVKKEKAVPTRCDLGDDS-