Monarch geneset OGS2.0

DPOGS206885
TranscriptDPOGS206885-TA3285 bp
ProteinDPOGS206885-PA1094 aa
Genomic positionDPSCF300001 - 2019202-2039294
RNAseq coverage208x (Rank: top 46%)
Annotation
HeliconiusHMEL0068760.087.10% 
BombyxBGIBMGA012843-TA0.087.02% 
DrosophilaKdm2-PB0.068.07% 
EBI UniRef50UniRef50_D2A5L60.055.50%Putative uncharacterized protein GLEAN_15153 n=2 Tax=Tribolium castaneum RepID=D2A5L6_TRICA
NCBI RefSeqXP_970863.20.056.02%PREDICTED: similar to F-box and leucine-rich repeat protein 11 [Tribolium castaneum]
NCBI nr blastpgi|1892383000.056.02%PREDICTED: similar to F-box and leucine-rich repeat protein 11 [Tribolium castaneum]
NCBI nr blastxgi|1892383000.056.20%PREDICTED: similar to F-box and leucine-rich repeat protein 11 [Tribolium castaneum]
Group
Gene OntologyGO:00055154.7e-36protein binding
GO:00036775.9e-14DNA binding
GO:00082705.9e-14zinc ion binding
KEGG pathway 
InterPro domain[225-393] IPR0033474.7e-36Transcription factor jumonji/aspartyl beta-hydroxylase
[554-598] IPR0028575.9e-14Zinc finger, CXXC-type
Orthology groupMCL11417 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206885-TA
ATGTCGGAAACACTGCCGAATAACAAAAAACAAGTTCCAAGGCAACTGATCGCAGCATTGCGATGTAGCGACGTTGAGAATGAACGCCGCGTAAATTGCGCACAGCAGCAACGTCGTTATCGGGGCATACCTTGTGGGCATCTGGTAGGGACGAAGGCGTTATCCTCCCGCGACTGCGGAGGGCAGGGTGCCTGGGGCAGAGAAAGGAAACAGCGTAAGCTCTATTCAGATGAGTGGGCGCTGGGTGACGACGAGGCGGAGGGCGGGCGGGGCTTCTCGCTCGCTGACAAGTTGGAGAGTCCGCGCTTCGAGCATGCTGGGGCTGTACTGGAGATGAACGGTGCTGAACTTACAGTGGCCTACCTCCAGAAACATGGCTTCACAACCCCATTGCTTTTCAAAGAGAAACTTGGGTTGGGTTTAAGGCACACTCGACAGGCAGCCAATGTTGAAGTACCAAACCCTACTGTGGGGATTAAAATGAGCGGATCTGAATTAGTGCCAACCAGCAACTTCACAGTTAACGATGTGCGCATGTGCGTCGGCTCGAGGCGGCTTCTCGACGTCATGGATGTTAACACACAGAAAAATATTGAAATGACAATGAAGGATTGGCAGCGTTACTATGACGACGAGAACAAGGAGAGGCTGCTGAATGTAATCTCGTTGGAGTTCTCACACACACGTCTCGAGAACTACGTCCAGGCGCCGCGTATTGTGAGACTTATTGACTGGGTGGATACAGTCTGGCCGAGACATTTGAAAGATCAACAGACTGAATCGACGAACGCGTTGGATGAAATGATGTACCCGAAGGTCCAGAAGTATTGTCTCATGTCCGTCAAAGGTTGTTACACGGATTTCCACATCGACTTCGGCGGCACGTCAGTCTGGTACCACATACTGAGAGGAGCAAAAGTATTCTGGCTTATTCCACCCACTGAAAAGAATCTCCAGCTGTACGAGAAGTGGGTGCTATCCGGGAAGCAATCGGACGTTTTCTTCGGTGACACGGTCGAAAAATGTATAAGAGTTCATTTACAAGCCGGTTACACATTCTTCATACCGACCGGTTGGATACACGCTGTATACACACCGAGTGATTCACTCGTCTTCGGAGGCAACTTCCTGCATCAGTTCGGTATTGAGAAACAGTTGAAGATAGCTCAAGTCGAAGACGTCACTAAGGTGCCTCAGAAGTTTCGTTATCCGTTTTTCACTGAGATGCTGTGGTATGTACTTGACCGGTACGTGAGCGCGCTTTTGGGGCGGTCGCACCTCGCTCAAGAGGGACAGCCCGCGCCCACGACGCCCACGACGCCCACGCCGCCCGCGCCGCCCAAGGAACATGTTCACCTCACGCAGAACGAACTTCACGGACTAAAAGCTATAGTGGTATATCTTCATCAGCTCCCAGCAGCCCGTAAGTCAGTGCCCGAACTGCTAACGGATCCAATAGCATTGGTCCGTGATGTCCGTACTCTAGTGGAACAGCACAGGCATGATAAACAACAGCTAGCCATCACTGGTCTCCCATTACTTAAAGGTCCAGACGAGCCGTTGTCCGGAGAGCGTCGTCAGTCAGGAGGTCGGGGAGGTCGTGGTGGTCATCGTGGTGGTCCCGCGCGACCACGGCCCGACCACGCCGCCAACGCGCCACGTCGCAGGCGGACCAGATGCAAAAAATGTGAAGCCTGCCAGCGTACAGACTGTGGTGATTGCGTGTTTTGTCATGACATGGTTAAGTTCGGTGGTTTGGGGCGAGCTAAACAGACTTGCATCATGAGACAGTGTCTACAGCCAATGCTACCAGTGACAGCTTCCTGCGCCGTGTGTCATCTGGACGGATGGATGCAGACACCTGTAGCACCGCAGGCTAAAGGCAACTCAGGTCGTAACGGTCCATCAGTCCTGCGGGAATGCTCTGTTTGCTACAGCATAGTTCATCCAGCGTGTGTCCCTCCCGGAGGACAGCTCAACGAGGACCTGCCTAACTCATGGGAGTGCCCCACATGCACCACCATGGGTCTCAACCATGATTATAAGCCACGTCACTTCCGAGCCCGACAAAAATCTTCAGACTTGCGTCGCATGAGCGTTGGATCCGACGCCAGTCAGCTGAGTCAACAACATAACACCACAACGGCACAGACAAAATTACCAGCCCCACCTCCACCGGTGACTGTGAAGTCTGAGGCGGGATCGGACAATGAAAGCAAACCTGAAAGCACTAACGTTAAATCAGAAGAAGAAATAAAACGCGAAGAAGATCCATCAGAAGCACCCGAGGCGGAGGGAGTTGAGTGGGAGCCGACGGCGAAGAAACGACGAGCCAGCGATGAAGACGAGGCGCCTAAGAAACAAGCACTGAGAGCACATCTGGCGCTGCAGCTGACACACCATTCAGCTAAAGCTTTAAAGAAACCAATATATCCTGTACGTCCGGCTCCTCTGAACGTGGCCAATGTGAGCGGTGCGTGGCTGGATCGAGGGGCGATGCTGCGTGTCTTCGCTAAGCTCACGCCACACGAACTGGCTACATGTGCCCTGGTCTGCAAGGCTTGGGCAGAGTATTCAATGGACCCATCCCTATGGCGAAGTATGTCATTCGTTCAGGTGCGAGTGTCGGCGGCGCAGTTGGCGGGCATAGCAAGACGACAGCCGCTCAGTCTGGGACTCGAGTGGTGTCATCTAGCGAGACGACAACTCGCCTGGTTATTGGCTCGACTACCAGCCTTACGGTCGTTATCCTTGGCTGGTTCCCCGGCAGAGGCCGCTTTAGCTCTTCGCTCCAGTACATGTCCGCCCTTGACAGCCCTGGATTTATCTTTCGCTAGATGTTTAGATGATGCTAAATTACGAGGAATTCTAGCACCTCCTGAGAACTCTAGACCGGGAGGCGGAGCCAGGTCAGGAACCGAGCCTTCAAGGTTGGCAGCACTGGCTGTTCTACGTCTACCAGGAACTGACATCACCGACGTTGCCATGTTGTATATTGTACAGGCTCTTCCCAAGTTATGCGAGCTGGATGCTTCATCCTGTGCCCGTCTAACAGACGCTGGCGCGGCTCAGCTCGCGCTGCACGGCTTGCAGCGACTTTCGCTGGCGGGCTGCAGGCTGCTAACAGAGGCCGCATTGGACCATCTCGCAAGATGCCCTAACCTAGTTAGGCTGGACCTCCGACATGTACCACTTGTGTCTACACAGGCTGTCATCAAGTTCGCAGCCAAAGCTAAACATAACCTCCATGTTAAGGATGTTAAGTTGGTGGAGTTGAGAACCTGA

Protein sequence:

>DPOGS206885-PA
MSETLPNNKKQVPRQLIAALRCSDVENERRVNCAQQQRRYRGIPCGHLVGTKALSSRDCGGQGAWGRERKQRKLYSDEWALGDDEAEGGRGFSLADKLESPRFEHAGAVLEMNGAELTVAYLQKHGFTTPLLFKEKLGLGLRHTRQAANVEVPNPTVGIKMSGSELVPTSNFTVNDVRMCVGSRRLLDVMDVNTQKNIEMTMKDWQRYYDDENKERLLNVISLEFSHTRLENYVQAPRIVRLIDWVDTVWPRHLKDQQTESTNALDEMMYPKVQKYCLMSVKGCYTDFHIDFGGTSVWYHILRGAKVFWLIPPTEKNLQLYEKWVLSGKQSDVFFGDTVEKCIRVHLQAGYTFFIPTGWIHAVYTPSDSLVFGGNFLHQFGIEKQLKIAQVEDVTKVPQKFRYPFFTEMLWYVLDRYVSALLGRSHLAQEGQPAPTTPTTPTPPAPPKEHVHLTQNELHGLKAIVVYLHQLPAARKSVPELLTDPIALVRDVRTLVEQHRHDKQQLAITGLPLLKGPDEPLSGERRQSGGRGGRGGHRGGPARPRPDHAANAPRRRRTRCKKCEACQRTDCGDCVFCHDMVKFGGLGRAKQTCIMRQCLQPMLPVTASCAVCHLDGWMQTPVAPQAKGNSGRNGPSVLRECSVCYSIVHPACVPPGGQLNEDLPNSWECPTCTTMGLNHDYKPRHFRARQKSSDLRRMSVGSDASQLSQQHNTTTAQTKLPAPPPPVTVKSEAGSDNESKPESTNVKSEEEIKREEDPSEAPEAEGVEWEPTAKKRRASDEDEAPKKQALRAHLALQLTHHSAKALKKPIYPVRPAPLNVANVSGAWLDRGAMLRVFAKLTPHELATCALVCKAWAEYSMDPSLWRSMSFVQVRVSAAQLAGIARRQPLSLGLEWCHLARRQLAWLLARLPALRSLSLAGSPAEAALALRSSTCPPLTALDLSFARCLDDAKLRGILAPPENSRPGGGARSGTEPSRLAALAVLRLPGTDITDVAMLYIVQALPKLCELDASSCARLTDAGAAQLALHGLQRLSLAGCRLLTEAALDHLARCPNLVRLDLRHVPLVSTQAVIKFAAKAKHNLHVKDVKLVELRT-