Monarch geneset OGS2.0

DPOGS207008
TranscriptDPOGS207008-TA2364 bp
ProteinDPOGS207008-PA787 aa
Genomic positionDPSCF300001 + 1107908-1114955
RNAseq coverage125x (Rank: top 57%)
Annotation
HeliconiusHMEL0143730.088.28% 
BombyxBGIBMGA012932-TA0.081.11% 
DrosophilaCG3711-PA0.056.99% 
EBI UniRef50UniRef50_E9I9G10.059.69%Putative uncharacterized protein (Fragment) n=4 Tax=Coelomata RepID=E9I9G1_SOLIN
NCBI RefSeqXP_001607546.10.064.18%PREDICTED: similar to leucine-zipper-like transcriptional regulator 1 (LZTR-1) [Nasonia vitripennis]
NCBI nr blastpgi|3504199780.063.36%PREDICTED: leucine-zipper-like transcriptional regulator 1-like [Bombus impatiens]
NCBI nr blastxgi|3504199780.063.36%PREDICTED: leucine-zipper-like transcriptional regulator 1-like [Bombus impatiens]
Group
Gene OntologyGO:00055155.3e-45protein binding
KEGG pathway 
InterPro domain[124-344] IPR0159155.3e-45Kelch-type beta propeller
[336-495] IPR0113338.6e-24BTB/POZ fold
[365-495] IPR0002108.6e-23BTB/POZ-like
[359-494] IPR0130691.7e-17BTB/POZ
[42-82] IPR0066521.6e-08Kelch repeat type 1
Orthology groupMCL13343 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207008-TA
ATGATAACAGAGCAACTAAGAAATCCCAGTGAGTTAGATATCAGCTTGAGGATGGAATTTGGGCCATTTGAAACAGTTCATAAATGGAAAAGGATGTCTGAATGTTATGAATTTGTAGGGGCAAGACGAAGTAAGCACACAGCAGTAGCATATAAAGACGCAATATATGTATTTGGTGGAGACAATGGGAAGTCTATGTTGAATGATCTGATTAGATTTGATATAAGAGAGAAGTCTTGGACTAAAACTGGAGGCATGGGGACGCCACCAGCCCCAAGATATCACCATTCAGCTGTAGTGCATAGATCTTCAATGTTTGTGTTCGGTGGATACACCGGGGATATATTGGCCAATTCTAATCTGACAAATAAAAATGATCTCTTTGAATACAAATTTCAAAGTGCTCAATGGGTACAATGGAAATTCACTGGCCAAGAGCCTGTGCCTCGTTCTGCACATGGAGCAGCTGTGTATGATGATAAATTGTGGATATTTGCCGGCTATGATGGTAATGCGAGGTTAAATGACATGTGGACTATAAATCTAGTGGGTGAAAATCATCAATGGGAAAGGATAGAACAAAAAGGTGAATGTCCACCCACCTGTTGTAATTTCCCAGTTGCTGTGGCCCGTGGGAAGATGTTTGTTTTCAGCGGACAAAGCGGTGCTAAGATCACCAATGCACTGTTCCAATTCGACTTCGAGACGCATACCTGGAGTCGTGTATGCACCGAACACCTGCTACGTAGCGCTGGACCAGCGCCCGCACGCCGATACGGTCACGTGATGCTGCACCATGCGAGACATCTCTACGTATTCGGCGGCGCCGCTGACAGTACTCTGCCCTCCGACTTGCACTGCTACGACCTTGATACGCAGATGTGGTCCGTTGTACATCCGGCGCCGGATTCTCAAATCCCTTCCGGTCGGCTATTCCACGCAGGTGCGGTGGTGGAAGACGCCATGTACATTTTCGGCGGCACCGTCGACAACAACGTGCGGAGCGGCGATCTGTATAGATTTCAACTCTCCAACTATCCCCGTTGCACTTTGCACGACGATTTCGGACGGATTCTTCGGTCGCAGCAGTTCTGCGACGTGACGCTGCTGTTGGGTGGCGACCAGGTCGCGTTTTACGCCCACCAGGCTATGCTGGCCGCTCGTTCGCAATACCTACGGATCAAAATAAAGGAAGCCAAAGAGGATCTAGCGAGACGCATCGCCGCTGGTGAAGAAGAGGCTGCCGAGGAGTTCTCATACAAATCAACGCCTCAGTTGACTGTGAAGTTACCAGAAGCTACCCCAGAAGCCTTCAGGATGGTACTGAATTATATTTACACAGACAGGATCGATCCAACAGAAAAGGACGAGAACCCGGCATCGCCCGCCACCATCTTGCTAGTGATGGAAGTTCTACGGCTGGCACTGCGTCTGAACATACCACGTCTCAGAGGTCTCTGCGCACGTTTCCTCCGAGCTAATCTCTGCTACAGCAACGTCCTGCAAGCACTGCACGCTGCCCACCACGCTAACCTTATCTGCATAAAGGAATACTGTCTCAGATTCGTGGTCAAGGAGTACAACTTCACAGCGATCGTGATGTCGTCGGAGTTCGAGCAAATGGACCAGCGTCTGATGGTTGAAGTGATAAGACGGAGACAGCAGCCGCTCCATAAACTGACTGCCAACAACGAGCACGAGGAGGAGGTCGTCGGTACAACGTTAGAACAGGATATGTGTGTGTTAGTGAGCGGCGGTGGACACGAACTGGCTGATGTCAAGCTGAGGGTAGGGTGTGCTATGAGACCAGCTCACAGGTCGATACTGGCCGCCAGGGCTTCCTACTTTGAAGCCATGTTCAGATCCTTTTCACCGCAGGACAATGTTGTTAACATCCAAATATGTGACACGGTACCGTCGGAGGAAGCGTTCGATTCACTACTAAGGTACATCTACTACGGCGACACTAATATGCCTACAGAAGATTCACTCTATCTATTCCAGGCGCCTATATACTATGGTTTCACAAACAACCGACTGCAGGTGTTCTGCAAACATAATCTACAAAGCAATGTCACTCCGGAAAATGTGGTCGCTATCCTACAAGCGGCGGACAGAATGAGGGCAGCCGACATTAAAGAATACGCCCTCAAAATGATCGTACATCACTTCCAACTGGTAGCTCGTCAGGAAGTTATCAAGAATTTAGCCCAGCCCCTACTAGTGGACATTATCTGGGCTCTAGCCGAGGAACCTCAGGCTGATACACCGCTGCCCTTACAGCCACGATCGCTTCCATCGTCGTCCTCAGCAGACACTCTCACCGACGACCCTGATTATATCAAGCACAAAAACAATAAATGA

Protein sequence:

>DPOGS207008-PA
MITEQLRNPSELDISLRMEFGPFETVHKWKRMSECYEFVGARRSKHTAVAYKDAIYVFGGDNGKSMLNDLIRFDIREKSWTKTGGMGTPPAPRYHHSAVVHRSSMFVFGGYTGDILANSNLTNKNDLFEYKFQSAQWVQWKFTGQEPVPRSAHGAAVYDDKLWIFAGYDGNARLNDMWTINLVGENHQWERIEQKGECPPTCCNFPVAVARGKMFVFSGQSGAKITNALFQFDFETHTWSRVCTEHLLRSAGPAPARRYGHVMLHHARHLYVFGGAADSTLPSDLHCYDLDTQMWSVVHPAPDSQIPSGRLFHAGAVVEDAMYIFGGTVDNNVRSGDLYRFQLSNYPRCTLHDDFGRILRSQQFCDVTLLLGGDQVAFYAHQAMLAARSQYLRIKIKEAKEDLARRIAAGEEEAAEEFSYKSTPQLTVKLPEATPEAFRMVLNYIYTDRIDPTEKDENPASPATILLVMEVLRLALRLNIPRLRGLCARFLRANLCYSNVLQALHAAHHANLICIKEYCLRFVVKEYNFTAIVMSSEFEQMDQRLMVEVIRRRQQPLHKLTANNEHEEEVVGTTLEQDMCVLVSGGGHELADVKLRVGCAMRPAHRSILAARASYFEAMFRSFSPQDNVVNIQICDTVPSEEAFDSLLRYIYYGDTNMPTEDSLYLFQAPIYYGFTNNRLQVFCKHNLQSNVTPENVVAILQAADRMRAADIKEYALKMIVHHFQLVARQEVIKNLAQPLLVDIIWALAEEPQADTPLPLQPRSLPSSSSADTLTDDPDYIKHKNNK-