Monarch geneset OGS2.0

DPOGS212774
TranscriptDPOGS212774-TA2313 bp
ProteinDPOGS212774-PA770 aa
Genomic positionDPSCF300012 + 950951-956465
RNAseq coverage248x (Rank: top 42%)
Annotation
HeliconiusHMEL0141610.053.94% 
BombyxBGIBMGA002931-TA2e-1046.30% 
DrosophilaCG3815-PA1e-9436.51% 
EBI UniRef50UniRef50_D2A4S43e-12536.18%Putative uncharacterized protein GLEAN_15317 n=1 Tax=Tribolium castaneum RepID=D2A4S4_TRICA
NCBI RefSeqXP_001814014.16e-12636.18%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|1892381101e-12436.18%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|1700584712e-12635.73%conserved hypothetical protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00055156.9e-13protein binding
GO:00082706.9e-13zinc ion binding
KEGG pathwayecb:1000569711e-11 
 K10603 (AIRE)maps-> Ubiquitin mediated proteolysis
    Primary immunodeficiency
InterPro domain[46-104] IPR0130835.6e-15Zinc finger, RING/FYVE/PHD-type
[36-103] IPR0110113e-14Zinc finger, FYVE/PHD-type
[53-98] IPR0019656.9e-13Zinc finger, PHD-type
[54-99] IPR0197871e-12Zinc finger, PHD-finger
Orthology groupMCL12804 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212774-TA
ATGTCTAACGTTAGCTATGATCTTGATACATCCGGCGGACTTATGCCGCTTATAAGAGCACTAATTAAGCCGCCCGACGAAGATTTAAATGCTTCGAAACCTAAGAAACCTCAGCATCCATATTATAAGCGGCCTGGTAAAGGCCACAACCACGACTCGTGCGACGCGTGTAGGGAAGGTGGCGATTTGATTTGTTGTGACCGTTGCCCGGCTAGCTTCCATCTTGGCTGCTATGATCCGCCGCTCGAGGAAAATGATATCCCTGCTGGGTCGTGGCTGTGTCGGGAGTGTAAGGCCGGTGACGAGAAGCAGGGTGTTGTACGCTCGATTAGGTTGCAGTCACCAACGGAAAAGACAGAAGGAGAGAAGAAATCTAGATCTCTTCGTAACAGTCGTACAAACTCGCTCAATAAGAAGAAAGTCAAAGATGACAACAAAGAAAAGGAAAAAGAGAAGGAAAATGAAGAAGTGAAGGAGAAAGAGAAGGAGCCGGAGCCGGGGAAGGAGTTATCACCGATGGAGATACTTGTCAAAGCGGCCAAGGTCATGAACCCTAAACAGTTTGAGCTGCCCAGGGAGATGAAGATCCCGTGTAACTTTCCCGGCACCGAGAAAGATGGTAAATCCTCGAGTGGTATAGTGACTGTGGACGCGTGGGGATGTGTGCCGCTCCCGGCCCGCTCGTGTTTCGTTTGTCGCGGAACCTGCAAGATGGCGCCACTGCTGCAGTGTGACTACTGCCCTTTATTATTCCATCAGGACTGTCTCGAACCACCGCTGACATCATTACCCACAGGCAGGTGGATGTGTCCCAATCATGTTGAGCAGTATATAGATTGGAAACTGGTCAGTTCAATATCAGCGACCGAGCGTGCGGCGCTGTGGGATAAATTCAACGAGCCGGTGGATCAAGACGCCATCAAATGCGCCTTCATACGACGCGCGAGGACATACAGACCGGCCTTCCGTATAAAGGTGCCGTTGTCCAGTCGAGGTAGAGTCGTTGTGCCGAGCATGGTGAGAGCTCACTACAGTCGACCGCCGCCTCTACTACCGCCGCCTCTACTGCCATCTAGAAGGGACTACGTTCGCTGCACTAACGTCATCAGGAAGCTCAAATCTGGCGGCGACTACTGTGACTCTGAAGGTGAGGCGCCCTACAAGATATGTATGAACCTGAGCTGTCCTCAGTACAGCGGGGGTGAATGCCCTCTGGATACGCCCAAATTGAAGGGCGCCACTAACGAGGACGCCGACGAAGATCTCAAGGAAATAGAAGACAGGCAGAAGGTGACGGCTGCAGAATCTAATAATGAGAAGAATAACGCGTCGGAAGCCAGTGACATCGACTGCGACCTGGAGAAGATTACGGTGAAGAAGCGCAAGGTCTCTTCGGAGATTGCTTCTAATAAGAGAATGAAGCTGGAGAAGTTGCAGCTGAAGGAGGAGGAGGGAGTGCAGGAACTTTTGGACGCCGTAGAGGAACAGCTGGAACAGATCGATGACCGGCTCGTGAAACTGCTGGCCTGGCAGAGGTTGCAGCAGATAGCTGCTGGCGAGGCTGTTTCCGGTCGTTGGCGGCACGCGCCCCCCCCTGGTCAGGTGGGCAGGGCTGCCATCGCTCTCAGCTCCGCCTCCCGAGCCTCGCTGGCTAAGCTGGGGGTGAAGACGGTACCCTTACCGTCCGACCTTCTGGCCCGGGAGGACAGGGACCGGATCGCCCGGCTCGTGTTTGGCGCCGCACCCGCGCCGCCCCCCGAACCCGCACCCCGTGACCGGCTCGCCAACTCTCTGGTCAGGGCTGTTCTCTGTCATGTGAAGTCGTTAGACGGCGAGGGTAACTCCACACTAAGCTCCACGGTCGCCATGCGCGGCTCGTCGTTGGTAGCTGGCGTGGACAGCGCCTGCGACCTCCGGCTGGACCCCGTGTGCTGCAAGGTGTCCGAGATACATGCCAGGATATTCAGCGACGAGGTAACCGGTCACTTCGAACTGATCAACTACTCGGAGTGGGGTACGCGAGTGAACGGAGTTGTATACGCCGCGGACGTGTCACAGACACGCGGCCAGGAAGAGGATGACAGGGCGGAGGCTCTCAGGGATATAGTGAGGTCGAGGGGGGTGAAGCTGCCGCGCATTTCCGGTCCACTGACGGAGGCAGCTTCCGGCGGCAGGTGCTCCTGTTCGTGGAAGGGCGCAGCGCCCGGCGAGGGCGGCGCCTGGGAAGGTTCGGCACTGTTACCACACGGAGCCCTAATACAGTTTGGCTGTCAGATGTACGTGTTCAGTATAACAGACCACAAAACTTCCTAA

Protein sequence:

>DPOGS212774-PA
MSNVSYDLDTSGGLMPLIRALIKPPDEDLNASKPKKPQHPYYKRPGKGHNHDSCDACREGGDLICCDRCPASFHLGCYDPPLEENDIPAGSWLCRECKAGDEKQGVVRSIRLQSPTEKTEGEKKSRSLRNSRTNSLNKKKVKDDNKEKEKEKENEEVKEKEKEPEPGKELSPMEILVKAAKVMNPKQFELPREMKIPCNFPGTEKDGKSSSGIVTVDAWGCVPLPARSCFVCRGTCKMAPLLQCDYCPLLFHQDCLEPPLTSLPTGRWMCPNHVEQYIDWKLVSSISATERAALWDKFNEPVDQDAIKCAFIRRARTYRPAFRIKVPLSSRGRVVVPSMVRAHYSRPPPLLPPPLLPSRRDYVRCTNVIRKLKSGGDYCDSEGEAPYKICMNLSCPQYSGGECPLDTPKLKGATNEDADEDLKEIEDRQKVTAAESNNEKNNASEASDIDCDLEKITVKKRKVSSEIASNKRMKLEKLQLKEEEGVQELLDAVEEQLEQIDDRLVKLLAWQRLQQIAAGEAVSGRWRHAPPPGQVGRAAIALSSASRASLAKLGVKTVPLPSDLLAREDRDRIARLVFGAAPAPPPEPAPRDRLANSLVRAVLCHVKSLDGEGNSTLSSTVAMRGSSLVAGVDSACDLRLDPVCCKVSEIHARIFSDEVTGHFELINYSEWGTRVNGVVYAADVSQTRGQEEDDRAEALRDIVRSRGVKLPRISGPLTEAASGGRCSCSWKGAAPGEGGAWEGSALLPHGALIQFGCQMYVFSITDHKTS-