Monarch geneset OGS2.0

DPOGS202165
TranscriptDPOGS202165-TA2130 bp
ProteinDPOGS202165-PA709 aa
Genomic positionDPSCF300162 + 49615-57140
RNAseq coverage416x (Rank: top 29%)
Annotation
HeliconiusHMEL0036977e-12263.56% 
BombyxBGIBMGA003435-TA2e-9058.57% 
DrosophilaCG1965-PA5e-5430.51% 
EBI UniRef50UniRef50_D6WKW54e-7935.58%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WKW5_TRICA
NCBI RefSeqXP_967900.17e-8035.58%PREDICTED: similar to gc-rich sequence DNA-binding factor [Tribolium castaneum]
NCBI nr blastpgi|910828011e-7835.58%PREDICTED: similar to gc-rich sequence DNA-binding factor [Tribolium castaneum]
NCBI nr blastxgi|910828013e-9835.29%PREDICTED: similar to gc-rich sequence DNA-binding factor [Tribolium castaneum]
Group
Gene OntologyGO:00056346.9e-55nucleus
GO:00036776.9e-55DNA binding
GO:00063556.9e-55regulation of transcription, DNA-dependent
GO:00037006.9e-55sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[161-617] IPR0128906.9e-55GC-rich sequence DNA-binding factor
[512-621] IPR0227836.6e-13GC-rich sequence DNA-binding factor domain
Orthology groupMCL12336 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202165-TA
ATGTCCTTGTTTCGTAAACCGAAGAAGATCCAGAGACGAGTTTTTTGTGCTGACGATGAAGAAGACGGTGAGCCGGAGGCACCTGTGCCGCCGCCGCCGCCGATTATTAGTAATTCAAGGAAGGAAAACAAACAAGTAAAAGTAACAACGCTATTGAGTTTCGCCGATGAAGAGGAAGAGGGCGAAGTATTCAAGGTGAAGAAGTCATCACAGAGCAAGAGATTGAGTAAACGGAGACAGAAAGAAAAACAACGCACAGATGGTGATAGTAATAAATATGACAATCACATGGTCGAGGAGAAACCGTCGGAGGAGATAGAAGAACCGAGGAAGAAGGTTACCCTCGAGGGTCTGATCCTGTCAGGGCGGGAGGCGTTGTCCGCGGACGGGGCGGGGGACATTTCCGAAGACAGCGAGGAAGATAACAGGGGGTTCCACACGTACCGAGCCGAGAGCGTGCGGGCGGCGCTCGCCGGCGCGGGGGGAATCCCCGACGCCGCGCTCATACACGCCGCGCGCAAGACCCGACAGCAGGCTCGTGAGTTGGGTGACTTTGTTCCCATCAAGAATGATGGCGGCTCCAGGATGATGAGAGATGATGACGCTGATGACGATGACGATGATGAGGCAGACGAGGGCCGGATACAGGTCAGGGGGTTGGAACTGCCAAGCGACAGACCCGAACGTGGTACAACAGCCGCCGCGTCTGATGATGAAGCTCAAAGTGAAGGAGAGGAGTGGGAGGAGCAGCAGATTAAGAAAGCTGTGCCCTCAATAGCTGATATTACAGGTGATTGTATCCCACTAAATCCGTTCGCTGTTCCTCCGCCCCCGGACACGCCGCGTCACCTGCGGTCCCTCGCGCGCCCCGGACAGCCTCCGCCAGCTACCGCGCAACAACTCGTAGAGGCGCTACGAGACAGGCTGTCAGAGCTTCACGAGAGTCGTGCGAGAACAGCGCAGCGTATGTATCACTTACAAGAGCGAGCGTCTAACGCGGCCGCCAAGCGTGAGAGGTGTAAGGGGTTGTGCTCGGAACTCGACCGCAGATACAAGAGGGCGCAGGCGGCCAGGGGGTACATCACCGACCTCGTGGAGTGTCTGGACGAGAAGATACCTCAACTGCAAGCGTTGGAGGCCCGGGCCCTGGCGCTTCATCGCAAGAGACGCGACCTGCTGGTGGAGAGGCGGAGGGCCGACGTGCGCGACCAGGCGCAGGACGTGCTCGCACTCGCCGCTCGCGCGGGGTCATCGAAGCCGGTGGACAGCGAGGAGAAGCGTCAGCGTACTGCGGAGCGCGAGGGCCGGCGGCGGGCGAGGCGGCTCAGGCGACAGGCGGCCGGCAACAACCAGCACAGGGACGGGGACTCCAGCGACGATGACCTGCCCCCAACCCTGCACCATCACTGTCAACAGGAGGCGGACGCGATCCGCTCTCTGTCGAGTCAGTTGTTCGCGGACACGTTGTCGGCCTACCGCAGTGTGCAAGGAGTCTGCGGTCGCATGGCAAGACTGAGGCGGACGCGCGGGTTGTACACGGACGCGTATGTCGCACAATGTCTGCCGAAGTTACTGGCGCCGTATGTTAGACATCAGCTGATCCTCTGGAACCCGCTCGCTGACGAAGACAACGAGGACTACGAGAAGATGGACTGGTACAAATGTCTCATGATGTACGGCTGCAAGTCCGAGCGCCTGTCCAGCGACTCAGAGCAGTCTTCCTCCGACGAGGTTTCCGTGACCGAGCTGGCCGTGAGAGACGACCCCGACCTACTGCTGGTACCCACGATCATAGACAAGGTCGTACTGCCCAAGATCACCGAGCTGGTGGAGCACGCGTGGGATCCGATGATGGTCCGCGCGTGTGTTCGCCTGCGTCAGCTATTGGAGCGGGCCGGTCGCCTGCCCGTGCGGAGCGCCCTGCCTCGCCTGGCGACCACCGCCCGCGCCGTACTCACGGCCTCGATCAACGCTGATGTCTTCCTACCTACACTACCGCCTCAGTACNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGCCGCACATTCTGTAG

Protein sequence:

>DPOGS202165-PA
MSLFRKPKKIQRRVFCADDEEDGEPEAPVPPPPPIISNSRKENKQVKVTTLLSFADEEEEGEVFKVKKSSQSKRLSKRRQKEKQRTDGDSNKYDNHMVEEKPSEEIEEPRKKVTLEGLILSGREALSADGAGDISEDSEEDNRGFHTYRAESVRAALAGAGGIPDAALIHAARKTRQQARELGDFVPIKNDGGSRMMRDDDADDDDDDEADEGRIQVRGLELPSDRPERGTTAAASDDEAQSEGEEWEEQQIKKAVPSIADITGDCIPLNPFAVPPPPDTPRHLRSLARPGQPPPATAQQLVEALRDRLSELHESRARTAQRMYHLQERASNAAAKRERCKGLCSELDRRYKRAQAARGYITDLVECLDEKIPQLQALEARALALHRKRRDLLVERRRADVRDQAQDVLALAARAGSSKPVDSEEKRQRTAEREGRRRARRLRRQAAGNNQHRDGDSSDDDLPPTLHHHCQQEADAIRSLSSQLFADTLSAYRSVQGVCGRMARLRRTRGLYTDAYVAQCLPKLLAPYVRHQLILWNPLADEDNEDYEKMDWYKCLMMYGCKSERLSSDSEQSSSDEVSVTELAVRDDPDLLLVPTIIDKVVLPKITELVEHAWDPMMVRACVRLRQLLERAGRLPVRSALPRLATTARAVLTASINADVFLPTLPPQYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPHIL-