Monarch geneset OGS2.0

DPOGS213246
TranscriptDPOGS213246-TA1917 bp
ProteinDPOGS213246-PA638 aa
Genomic positionDPSCF300124 - 8520-12688
RNAseq coverage15x (Rank: top 82%)
Annotation
HeliconiusHMEL0117633e-9744.19% 
BombyxBGIBMGA009525-TA3e-7051.78% 
Drosophilaerm-PA3e-3848.91% 
EBI UniRef50UniRef50_UPI0002247C214e-9650.13%UPI0002247C21 related cluster n=1 Tax=unknown RepID=UPI0002247C21
NCBI RefSeqXP_396560.34e-9850.40%PREDICTED: similar to CG1402-PA [Apis mellifera]
NCBI nr blastpgi|3503995491e-9950.93%PREDICTED: hypothetical protein LOC100740813 [Bombus impatiens]
NCBI nr blastxgi|3838477692e-9746.30%PREDICTED: uncharacterized protein LOC100876672 [Megachile rotundata]
Group
Gene OntologyGO:00036769.6e-12nucleic acid binding
KEGG pathway 
InterPro domain[259-285] IPR0130879.6e-12Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL18882 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213246-TA
ATGTTTTCTGACGAGGAAATAAAAGAAAACAGCCTCATGCGGCCGGGAGATGAGATAATAAGGTCCGATCTTCGTTTATTTTTTACAAAAGTTACGCAAGGATTTCTATCTTTTGTTACATTCTATTATGTTTTTCCAATGTTTTATTGGAGTTTTTTGTATTTCAGTATACGGGAGGTATGCCTCGTCGACAAGTACGATAAAAAATATTTGGGCGGTGAAACATTGCTTTCTACTTCAGGTCGAAATGATTTGGAGGTTCCTAGGTACCTCCGTGACAGTCTCTGTGAGGGTCTCGAGGCGAGATTTGCGTGGACCGAGGAAGATGGGAATATCGCTTCCTTAGTTTCTGAACCAGGATTTGGAAGCGGAACCTTTGGCAAGAGCCTTGATGCGACTTTGACTAGTGAAATTAAGGAAGCTGACACCAGTTTTCCATCTACCAGCAACACGTTTGATGGAAAGTCTTTTGTGGAACCATCTTTAGTGTCTTCTACGTCTTCCGGATCCAGTCTGAACGCCTCGTCGGAACGTTCGAAACGCTCATACAAAGTTGACAAAACCGACCCACGTTCAAGATATCACTACGTAAAATACGTTAAAAGACACGGGCGCACCGTCAAACTCTGGGAGTGCGGAATATGTTCTAGGGAGTTTCAACATCAGTATACCTTAATGCGTCATCTCCCCACACACACGGACGAAAGGAATTTCCACTGTGACGCCTGCGCTAAGAGCTTCCGTCAGCTGTCCACGCTCAGCCAACACAGAGCCATACACTCCGCTGAAAGACCATACGCCTGTGAGGTATGCAACAAGACTTTCAATCGAGTATCAACCCTGATCTCGCACCGCAAAACCCATTCCAACGAGAAACCATATAGATGCCACATTTGTCCAAAAGGCTTCCATCAAAAAGGAAATCTCCGTAATCATTTATTCACTCATACCAATGAGCGACCCTACCGATGCAATATTTGCATGAAGGGCTTCAATCAGCAGTCGAATCTGGTGTGCCACAAAAATAAGGCCCATCCAGAAGAAAATGGCAGCAGTAATGGAAGAAATGTGAATCAGCCGCGAAGAGTTACACAACCTCAACCAGAACCACAGACGGACAATAGATCATCACAGCAGTGTGAAGTGACGTCATCAGTAGGTCCGTACATACCAGAACCGAAGTCGTGGTCAAATTCGAAACCAAGCTGGTTGTCGAAACCTGATAATGATATATGGAATGAGGTTTCGTGGGGTAACAATGGGGTTATCGTGGATCCTATAAACACTTATCACATGGGGGTTGCCATAGCAACCAGACAGACTCCTTTCGCGCTACTAAAGTCTGATACGGGAACTCCTGTGTTGGTGAAAGTAGTCGATACAAAGCTTCCCGGCGGCAAACAGATGCTAGTACCGGCTACGGCAGAGGATTTGCGTGTTGGTAGTAAAATAATTTTGGACAATCAAGAAAGTCCTGCAGTGGATGTCCAATCTTCCGACGCTAACGCGGTTCAGATCAGGGTGCCGGTTGTGGCGACTGTGGTCCCTAAAATGAAACCGGGTGGTAGACTCCAGTTATCAGTAGAAGAACCCCATCATTCATACCATTCAGCTCTACCGACTGACATTGGTGAAGTGAAGGTAGAGCCTTGTACAAGTCCGGCATCGAACCCGCTGCCTGATGCCAAACAAGTCGATGGAACATTCCAACCGAGCATAAAACCTGGTCGCTCATGGATCACACCGGCCTCCCCACCACTGGACCTCATCCCCCTAGACCTGTTTGAGCCCATGGGGTGTATACCACTAGGGCCCCAAATAACATCAGTAGATATCGACCAGCCCCCACATTCTGATGATTCTGACATATTTATAGGAAAGTTCGAGGAAAGTATCCCTTTAACTGATTCTGACTGA

Protein sequence:

>DPOGS213246-PA
MFSDEEIKENSLMRPGDEIIRSDLRLFFTKVTQGFLSFVTFYYVFPMFYWSFLYFSIREVCLVDKYDKKYLGGETLLSTSGRNDLEVPRYLRDSLCEGLEARFAWTEEDGNIASLVSEPGFGSGTFGKSLDATLTSEIKEADTSFPSTSNTFDGKSFVEPSLVSSTSSGSSLNASSERSKRSYKVDKTDPRSRYHYVKYVKRHGRTVKLWECGICSREFQHQYTLMRHLPTHTDERNFHCDACAKSFRQLSTLSQHRAIHSAERPYACEVCNKTFNRVSTLISHRKTHSNEKPYRCHICPKGFHQKGNLRNHLFTHTNERPYRCNICMKGFNQQSNLVCHKNKAHPEENGSSNGRNVNQPRRVTQPQPEPQTDNRSSQQCEVTSSVGPYIPEPKSWSNSKPSWLSKPDNDIWNEVSWGNNGVIVDPINTYHMGVAIATRQTPFALLKSDTGTPVLVKVVDTKLPGGKQMLVPATAEDLRVGSKIILDNQESPAVDVQSSDANAVQIRVPVVATVVPKMKPGGRLQLSVEEPHHSYHSALPTDIGEVKVEPCTSPASNPLPDAKQVDGTFQPSIKPGRSWITPASPPLDLIPLDLFEPMGCIPLGPQITSVDIDQPPHSDDSDIFIGKFEESIPLTDSD-