Monarch geneset OGS2.0

DPOGS201569
TranscriptDPOGS201569-TA1917 bp
ProteinDPOGS201569-PA638 aa
Genomic positionDPSCF300201 + 126446-135524
RNAseq coverage42x (Rank: top 72%)
Annotation
HeliconiusHMEL0109212e-11946.50% 
BombyxBGIBMGA006117-TA2e-12552.87% 
DrosophilaCG8419-PA5e-8030.10% 
EBI UniRef50UniRef50_D6WLM76e-9233.59%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WLM7_TRICA
NCBI RefSeqXP_002078605.11e-7930.25%GD23513 [Drosophila simulans]
NCBI nr blastpgi|2700069802e-9133.59%hypothetical protein TcasGA2_TC013417 [Tribolium castaneum]
NCBI nr blastxgi|2700069802e-9634.20%hypothetical protein TcasGA2_TC013417 [Tribolium castaneum]
Group
Gene OntologyGO:00082701.2e-09zinc ion binding
GO:00056221.2e-09intracellular
GO:00055151.4e-05protein binding
KEGG pathway 
InterPro domain[456-558] IPR0147565.4e-13Immunoglobulin E-set
[458-554] IPR0178681e-09Filamin/ABP280 repeat-like
[253-288] IPR0003151.2e-09Zinc finger, B-box
[458-557] IPR0137832.2e-09Immunoglobulin-like fold
[160-176] IPR0130835.7e-08Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL16462 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201569-TA
ATGGAAGTAGACCGTATTTTATATATTTTCGGAAGCTTCTCCAGGAAGAAGAGGGAGAGAAGGAAATCCCTGGAGCCATCGTCGATCAGCGCTGGGAATAGTCCTTTGAAAAGTAACTCCAAGTCCAAACTGTCTCGGCCATCAAGTCAGAATGCTATCGCGCCTAGAACTTCTGCGTATGAGGAGAAGAGGAAGGCAAAACTGAAGGAATGGGAGTGTGGTATATGCAAGAGCGAGCTGGTGGAACCCAGACTACTCGCGTGTCTTCATAGTTTCTGTACCGACTGTCTCTTGGGGCTACACTGTGAGGGGGATGATCTATGGGAGGATAACGAAGGATGTCCGGAACATGACGTTGGTTTTGGCAGTGCATCAGCTGGAGGCTCCGGCGGCTCAGGCTACGAGACCCTGAGACATTCCGTCTCTGAAAGTAGTCTGGAGAAAATTAAATATGGGATCGTTTCTAGGAAGGTATCCGGTAAGAGCTATCATTTCGTGGTGTGTCCACTATGCGGTTCAGAAACTCAACTGCCTCTCGGTGGAGTGTCAGCACTGTCGCTGAACTACGTGCTGCTGCGGAGGATGACGAGCAGAGATGGCGACAGCGCTGTCCTCTGTGACCTCTGCTGTGCTGATAATAAGGCAGAGAGTCGCTGCAGCCAATGTCTTGTAAGCGTGTGTACATCGTGCGGGGATTCCCACACCCTGAAAAAGGGAAACGCGACACATCTTCTGGAACCCCTGGCACCATTCCTAAGGTTCTGCGGACAACATCCGAAAGTCGAGCTGACCGTTTACTGCGCTACTTGCCAGCAGGTGATTTGTCGCGATTGCAGTCTGATCTCCCACGGGGGACACGCTCTGGAGGGCGCGGGCAGGGCTGCGGCAGGGAAGGTGGCTGCCCTCAGGGACGCCATGCAGAGGGCTGGACTTGTACCAGACCACGTACAACGAGCTAACAGAATACTGGACGTACACGCCAGGGATATTGATGAGCAAGCTTCCCGCGTGGAGTCCGAGATCCGCTTGTGGTCGGAGGAGTATCGCAGATCGTTAGAGACACACGCGAGGGCGCTGTGTGCGGGGGCGGTGAGGGCGCGCGCTCGGTACAGAGCCAGAGCCACGAGACAGCTGCTACAGTTGGAGCAGAGAGCTGAACATGCCAGGGAGGCTGTCAAGTTCGCAGAGGAGCTATTAAGTGAGGGCAAGGAAGACGAGATCCTGTCCCTCAGCGGGCCGGTGCTGAAACGTCTCCAGAACCTCACGGAGCTGCAGCCTCTGTGCGAGGCGGCGCGGTGCGAGCTCCGGTTCGCCCCCAACGCGCCGGCCGCTCACAACCCCTCACTCGTCGGCAGGTTGTACACCATGGGACCAGACCCACAGCAATGTGTGCTTGATACTGACGGTCTACAAGATCTCCGAGTGGACTGCCAGCACACAGCTATATTAGAGTTACGTGACAGCAACGGAGATCGTATCTGGTGCGGCGGCGAGACGGTTTGTGGATATTTCCGTCGTCGGGACAGTTCGTCTCGGCCCGCCGCGGCTCGCGTGAGGCCGCGTGTCGATGGTTCATACGCCCTGCATGTAGCACCTCGAACACCTGGACACTACCTGCTGGCTGTCACAGTCGACAACCAGCCCATAAAGGGTAGTCCGTTCTCGTGTTCAGCTCGCCTGTCCAAGTCCCACTCGGGACAGTTCCATTGCTGTTCGTTCTGTTCGTCGGGAGGTCGCCGGGACGCCACTTGCGGCTGTGGATCCTCTATGGGCGGAGGTTATAAAGGCTGCGGTCACGGTCACGCGGGCTGGCCGGGGACGAGACACTGGTCCTGCTGCGGAGGGACGTCCCGCCACGCGCCATGTACCACCCGCACTACCAACGACACACCACACACATACCACGTGTCGCTCTGA

Protein sequence:

>DPOGS201569-PA
MEVDRILYIFGSFSRKKRERRKSLEPSSISAGNSPLKSNSKSKLSRPSSQNAIAPRTSAYEEKRKAKLKEWECGICKSELVEPRLLACLHSFCTDCLLGLHCEGDDLWEDNEGCPEHDVGFGSASAGGSGGSGYETLRHSVSESSLEKIKYGIVSRKVSGKSYHFVVCPLCGSETQLPLGGVSALSLNYVLLRRMTSRDGDSAVLCDLCCADNKAESRCSQCLVSVCTSCGDSHTLKKGNATHLLEPLAPFLRFCGQHPKVELTVYCATCQQVICRDCSLISHGGHALEGAGRAAAGKVAALRDAMQRAGLVPDHVQRANRILDVHARDIDEQASRVESEIRLWSEEYRRSLETHARALCAGAVRARARYRARATRQLLQLEQRAEHAREAVKFAEELLSEGKEDEILSLSGPVLKRLQNLTELQPLCEAARCELRFAPNAPAAHNPSLVGRLYTMGPDPQQCVLDTDGLQDLRVDCQHTAILELRDSNGDRIWCGGETVCGYFRRRDSSSRPAAARVRPRVDGSYALHVAPRTPGHYLLAVTVDNQPIKGSPFSCSARLSKSHSGQFHCCSFCSSGGRRDATCGCGSSMGGGYKGCGHGHAGWPGTRHWSCCGGTSRHAPCTTRTTNDTPHTYHVSL-