Monarch geneset OGS2.0

DPOGS204938
TranscriptDPOGS204938-TA1164 bp
ProteinDPOGS204938-PA387 aa
Genomic positionDPSCF300160 - 84643-90202
RNAseq coverage9x (Rank: top 85%)
Annotation
HeliconiusHMEL0036391e-12068.66% 
BombyxBGIBMGA005281-TA1e-13978.53% 
Drosophilagprs-PA1e-17688.51% 
EBI UniRef50UniRef50_E0VVS00.091.45%Putative uncharacterized protein n=2 Tax=Neoptera RepID=E0VVS0_PEDHC
NCBI RefSeqXP_002430214.10.091.45%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|2420195310.091.45%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastxgi|2420195310.089.65%conserved hypothetical protein [Pediculus humanus corporis]
Group
Gene OntologyGO:00055153.3e-18protein binding
KEGG pathway 
InterPro domain[134-197] IPR0113335.8e-23BTB/POZ fold
[53-201] IPR0002103.3e-18BTB/POZ-like
[137-198] IPR0130695.4e-10BTB/POZ
[221-315] IPR0117053.6e-06BTB/Kelch-associated
Orthology groupMCL16429 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204938-TA
ATGGATATTACAATGACCCGTCATTTCAGCTTAGATGAGTCGTCAATGAGAAGTGGAGCGTTAGTTGCAATGGAGGCGGAGCCGGACCTCAGTACTTTTGAAAACAAATCCGGACTAGCTGAAGATATGAAGTTTCTTGCCTCTATGCCTGAGCTCTGTGACGTCACATTTCTTGTTGGAGATACTCGAGAACCAGTATGTGCAGTCAAAGCAGTATTGGCAGCTCGTAGCAGAGTTTTCCAAAAAATGCTTTATCAAGCTCCGAGTCCACAAAGAAAAAAAGAACCAGCTCCTCGGGAAAATAAATTGAGATTATTTTTAAAGAGATCATCTGAACCTCTTCTTAATCTTCAGAATGCAGCTCAACAGAGATCGACTTTCGCGCAGCAATTGGCCCCAATTCAAGAGCCTTCTTCACAACAGCATCAGACTCTAATCATAGAAGAATTTGAGCCTGATGTTTTTCGTCAGCTTATCGAATACATTCATACTGGCTGCGTCACACTACAGCCCAGAACATTGTTAGGAGTAATGAACGCAGCCGATTATTATGGTTTGGATGAACTACGCCGCGCCTGCGCTGGTTTTGTCCAATGCTGTATCACTGTTGATACGGTGTGCGCTCTTCTAGCATCTGCAGAGAGATACATACAGTACAAATGTACCAAGTCTCTGGTTCAGAAGGTACTAGAGTTTGTAGATGAACATGGCAATGATGTGTTGAATTTGGGTTCATTTACCCTGCTTCCACAGCATGTTGTCAGGCTAATTTTGGCACGAGATGAGCTCCGAGCAGATGAGTTCACTAAATTTCAGGCTGCACTTATGTGGAGTAAAAAGTATTGTGATACAAATCCAAATATGATTTTGAAAGATGTTATCGGAAATTTTTTAGAATATATACAATTCCATAAGATTCCAGCAAATGTTTTAATGAGAGAAGTTCATCCTCTGGGATTGGTTCCCTACTCTATCATTATGAATGCTTTAGCTTATCAGGCAGACCCGGCCAGCGTAGATCCAGGCAAACTCTCTCCTGCGCGAATCCGGCGCGCTGGCCGTTCTATGTCCGTTCAATCATCGCTTGACCCCTACGGCTCAAACACCACCCTTTCCTCGACCGGATCTAGCGATGGTCCATCTTCAGACTCTCGGCACAACTAA

Protein sequence:

>DPOGS204938-PA
MDITMTRHFSLDESSMRSGALVAMEAEPDLSTFENKSGLAEDMKFLASMPELCDVTFLVGDTREPVCAVKAVLAARSRVFQKMLYQAPSPQRKKEPAPRENKLRLFLKRSSEPLLNLQNAAQQRSTFAQQLAPIQEPSSQQHQTLIIEEFEPDVFRQLIEYIHTGCVTLQPRTLLGVMNAADYYGLDELRRACAGFVQCCITVDTVCALLASAERYIQYKCTKSLVQKVLEFVDEHGNDVLNLGSFTLLPQHVVRLILARDELRADEFTKFQAALMWSKKYCDTNPNMILKDVIGNFLEYIQFHKIPANVLMREVHPLGLVPYSIIMNALAYQADPASVDPGKLSPARIRRAGRSMSVQSSLDPYGSNTTLSSTGSSDGPSSDSRHN-