Monarch geneset OGS2.0

DPOGS207860
TranscriptDPOGS207860-TA3105 bp
ProteinDPOGS207860-PA1034 aa
Genomic positionDPSCF300042 + 1750574-1756160
RNAseq coverage131x (Rank: top 56%)
Annotation
HeliconiusHMEL0084320.060.78% 
BombyxBGIBMGA009945-TA0.047.62% 
Drosophilacrol-PE2e-4427.53% 
EBI UniRef50UniRef50_UPI0001CBAEE43e-5424.02%UPI0001CBAEE4 related cluster n=1 Tax=unknown RepID=UPI0001CBAEE4
NCBI RefSeqXP_002738213.15e-5524.02%PREDICTED: zinc finger protein 107-like [Saccoglossus kowalevskii]
NCBI nr blastpgi|2912365721e-5324.02%PREDICTED: zinc finger protein 107-like [Saccoglossus kowalevskii]
NCBI nr blastxgi|2912365722e-6724.58%PREDICTED: zinc finger protein 107-like [Saccoglossus kowalevskii]
Group
Gene OntologyGO:00036765.8e-08nucleic acid binding
KEGG pathway 
InterPro domain[782-808] IPR0130875.8e-08Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL19580 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207860-TA
ATGTCATCCGATGAATCAGATGATGAATCCCTAGCATTCTTAGCGGCCTCAAAAAGAATTAAGGTGGAAGAAGATGAATTGTTAAATAAAGGAAATGAAGACCCTTCCACAAAAAAGGTTAAAGCCACAAAGAAAACTGATCGTAAACTTAACGTTGGCGCTCCAAATGTTATAGAAAGACCTGCGGATGTATGGCTGTACCTTAAAGATTTGAAACCCTCAGGCCCTTACAGCTGCTTACTGTGTGATGATTGGTTTATTAATCGATCTAAAATGATTTTGCATTACGCAGTAAATCACAAAAAAGATTTTTGTGGTATATGCAGATACTTCGTACCAAATAGACAGGCCTGGTATGCACATGAGAAATTTCATTCACCCTGGCCATGTTCACAATGTGTAGAAACTTTTACCTCGGAGCTAATGTTGAGAGAACATCTTAATTCTGCACATAATCTGGTCCACTGTAGATTGTGCCACTTCAGAGTGTCTGCTGATTTCAATTACAACTCACATTTATTTGAAAAGCACAATGTCACTAATGTATCTTCCAAAAATGAGGATGTTTTATGGAAAGTAGAAGGTGGTACGTTTCAATGTTTACTCTGCTCTAAATCAGAAAATACACTATCAACTTTCTTTGGACACTTTATGGGTATTCATCATTTAACTTTAAAGTGCCTCACATCAGTTATAGCCGGCAGAGACACACCTTTCACAGTGAAAGGGGCTGATGTTAGTGAAAAATTTATTAATGAGCAACTCAAAAGCCATGTTCGATTAGGTTATGTAGACTGGGAAACAAAAGATGATAAAACAATGAATGAATTCAAAAAAGAAAAGGATTTGGAAAATAGTTTGACCAAAGTAGAACAAAGTGCATCTGTGAGAGAAATAAAAGAAGAAGTAATTAGTGATGAAGAAGAATTAGTTGAGAAAGAAAATGAAGTTAATCAGAAAGATGAGCCTTCAAGCAGTTATTATAAATGGGCTGAGGATTTTGACATTACATACATGGAAATTATAATAGTCCATAAATCATATTATGACTATGTCGACACGTCCCTCAGGGATATCAATTCAAATTTAATGCCAGAGAAATCGTATTTGAATTACGAAAGAATGAAAGCAGAAATATATATGGACGTTGAATGCGGATTTTGTAAAACGAACTTTGACACAGCACAGTCGTTTGTTGAACATATGAATAAAATTCACAGTGTTAAATCAGTACCCTTATATTCTTGTAGAGTGTGTTGCGAGACGTTTGATAATTATTTAGATTTATGTACACATGTTACCGAGGAGCTGGCAGACTTTGAGGATCTTTGGATTTGTCAGTTTTGTGACAAGGAGTTTGATAACCGTGAAGAGACAAGACATCATCTGACCGAGCATTGGACTGCTTTGGACTATGATAACTGTTTTAGTCCGCATTTAGGTTTTAAATGCAAATACTGTCCGACATTATTCTGGAACGAACCGGACAGAGAGACTCATCAACTTAGAGTGCATTTAGATAAATATAAACATCAGTTCTACAAGTGCGAAAAATGTGATATGGAATTCGGCGATAAGGTCTGGTATGTATACCATCATTTAGAAAACCATCAAAACCCAAATGCAGTAACTAACTATATCCTAAAGTGTAACATCTGTTGTTCGGTGATGGCAACCATTGAGGAGATGAGGAATCACTTTGCAAGAAACCATCTCGAGTTCAAGAAAGTCTACTGTAACATAGATCCTTGCTGTTATAAGCCATTGAATCACCAGCGGTCTCTAAAAATACATATTAAGATGGCGCACAGGATAACAGATTTACCCAAAACTCCGAGAGTGCCAAAAACTAAAAAGAAGGTGTCATGCAACATGTGCAATCGTAAGTTCAACAACGCTCGTGCTTGCAGCACACACATGGCACAGGTGCATGGACCTGGGAAGTTCAAATGCAAACTGTGCCGCGAGGTGCTGCAGACTGCTGATGAAAGGAAGCTCCACTACCTCCTGTGCCACCCTGGTCGGCATCCATTCGAGTGTACTGAGTGCGGAAAATCATTTCAATACAAATCATCACTGTACATGCACAAACAGGAACACATGCCGAATAAACAGAGCTACACCTGCAGTTACTGCAGTAAGGTTTTCGCGAAGAAGGATTCGTATCGTGAACACGTCCAGATACACGAAGGTCCTCGCCACGCGTGCTCGTACTGTCCGATGAGGTTCGTCCAACGTTCCAACATGTTGAGACACGAACGACGGCACACAGGCGAGAGACCTTACAGGTGTCCTCATTGTACGAGGACCTTCGCTGATAAAGGGGCCTGCACTTCACATGCTAGGACACATTCGAAAGACTCGTCCTATGCCTGCGTGTACTGCGGTCAGACGTTCGTACAGAAGTCGAAACTCACGTACCATATCAGGAAACACACGGGAGAAAATTTGGAGTCGTGTTCCGTCTGTTCGAAGCTGTTCACCAGCGCGTGCTCGCTGCGGGAACACATGAAAATACACGTGGAGAAGAAGAAGATCGTCAAGTGTCCTCTATGCGACAAGGGCTATCAGGACGAGCGTTATATGCTGCGTCACCTCCGCACGCTACATTCCCGTTCACAGTTCTCATGTCCGTTGTGCCACAAGCTCCTCTCCAGCGCTGCAGGTCTCCGTCACCACGTCATAACACACAGCTGCGTCAACACTTTCCAGTGTAAATCCTGCACAAAATCCTACGCAGTGAAAAGGACCATGTTGAAGCATTTAAGGAAGCGGCACGGCTTAACGGGCAACGAGTTAAATATAAAGGATTACTACACTAGATTAGAGCCACGCGAGTGTCAATTGGATCTAGACGAGACAACGATGACCAGTATATTCGGACCTCCCAAGAAGAAATCGACGGACATATTGTTCGGGGATTTCGTAACTTTGGCTAAGAAAATCAATGGACCAGAAGAAAAGAAACGAGATGGAGATAGTAGTAGCGATGAGCCGGTTACAAGGATTAAGCAAGAGGTCCAGAACCAAACTGAAATAGAAATAGAACCAACAGATTTCGTCAGTGTTAAGATTGAAAGTGTGGACGCTGGTTATACAGAGTGA

Protein sequence:

>DPOGS207860-PA
MSSDESDDESLAFLAASKRIKVEEDELLNKGNEDPSTKKVKATKKTDRKLNVGAPNVIERPADVWLYLKDLKPSGPYSCLLCDDWFINRSKMILHYAVNHKKDFCGICRYFVPNRQAWYAHEKFHSPWPCSQCVETFTSELMLREHLNSAHNLVHCRLCHFRVSADFNYNSHLFEKHNVTNVSSKNEDVLWKVEGGTFQCLLCSKSENTLSTFFGHFMGIHHLTLKCLTSVIAGRDTPFTVKGADVSEKFINEQLKSHVRLGYVDWETKDDKTMNEFKKEKDLENSLTKVEQSASVREIKEEVISDEEELVEKENEVNQKDEPSSSYYKWAEDFDITYMEIIIVHKSYYDYVDTSLRDINSNLMPEKSYLNYERMKAEIYMDVECGFCKTNFDTAQSFVEHMNKIHSVKSVPLYSCRVCCETFDNYLDLCTHVTEELADFEDLWICQFCDKEFDNREETRHHLTEHWTALDYDNCFSPHLGFKCKYCPTLFWNEPDRETHQLRVHLDKYKHQFYKCEKCDMEFGDKVWYVYHHLENHQNPNAVTNYILKCNICCSVMATIEEMRNHFARNHLEFKKVYCNIDPCCYKPLNHQRSLKIHIKMAHRITDLPKTPRVPKTKKKVSCNMCNRKFNNARACSTHMAQVHGPGKFKCKLCREVLQTADERKLHYLLCHPGRHPFECTECGKSFQYKSSLYMHKQEHMPNKQSYTCSYCSKVFAKKDSYREHVQIHEGPRHACSYCPMRFVQRSNMLRHERRHTGERPYRCPHCTRTFADKGACTSHARTHSKDSSYACVYCGQTFVQKSKLTYHIRKHTGENLESCSVCSKLFTSACSLREHMKIHVEKKKIVKCPLCDKGYQDERYMLRHLRTLHSRSQFSCPLCHKLLSSAAGLRHHVITHSCVNTFQCKSCTKSYAVKRTMLKHLRKRHGLTGNELNIKDYYTRLEPRECQLDLDETTMTSIFGPPKKKSTDILFGDFVTLAKKINGPEEKKRDGDSSSDEPVTRIKQEVQNQTEIEIEPTDFVSVKIESVDAGYTE-