Monarch geneset OGS2.0

DPOGS204267
TranscriptDPOGS204267-TA1332 bp
ProteinDPOGS204267-PA443 aa
Genomic positionDPSCF300046 - 168198-169529
RNAseq coverage295x (Rank: top 38%)
Annotation
HeliconiusHMEL0152000.093.00% 
BombyxBGIBMGA007536-TA0.087.36% 
DrosophilaCG5742-PA2e-16158.67% 
EBI UniRef50UniRef50_E2A9010.073.50%Ankyrin repeat domain-containing protein 13C n=1 Tax=Camponotus floridanus RepID=E2A901_CAMFO
NCBI RefSeqXP_001657940.10.076.66%hypothetical protein AaeL_AAEL006663 [Aedes aegypti]
NCBI nr blastpgi|1571140090.076.66%hypothetical protein AaeL_AAEL006663 [Aedes aegypti]
NCBI nr blastxgi|1571140090.076.66%hypothetical protein AaeL_AAEL006663 [Aedes aegypti]
Group
Gene OntologyGO:00055152.6e-05protein binding
KEGG pathway 
InterPro domain[158-433] IPR0218323.1e-73Protein of unknown function DUF3424
[9-98] IPR0206831.7e-18Ankyrin repeat-containing domain
Orthology groupMCL11896 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204267-TA
ATGTCTGAATTGATATGTAGCAGCGAAAATGAAGTTTATCCACTTCACGAATGTGTTTTTATGGGAGACGTGCGAAAATTGTCGTCTTTACTAAGGTTCAATGATGTGACAAGAAAAGATAAACATGGAAACACAGCACTGCACCTTGCTGTTATGCTAGGTCGCAAAGAATGCGTGCAACTTTTGTTAGCTCATGGTGCGCCGGTAAAGGTTAAAAACCTTGCAGGATGGTCCCCTCTTGCAGAGGCCATCAGCTATGGCGATCGCCAAACTATATCCTCTCTGGTACGCAAACTGAAACAGCAAGCTCGTGAACAGATGGAAGTCAGAAGACCAGATCTTATAAGGGCTTTGTCACAAATACAGAACTTTTATATGGAATTAAAATGGGATTTTCATTCTTGGGTACCGCTTGTGTCCAGAATATTGCCATCTGATGTATGCAAAATTTACAAATCGGGTTCTGGTATTAGATTGGATACAACACTAGTAGACTTTACTGATATGAAATGGGAAAGAGGGGATGTGTCATTTATTTTTAAAGGTGATAAACCTCCAAGTCAGTCACTAACAGTTCTGGATAACAAGGTCAAAGTGTACCAACATGTCCGCTATGAGGAAACAGAAAATGAAATAGAGGATGAAGTGGACTTGCTTATGTCAAGTGATATACTTGCTGCACAAATGTCAACCAAAGGCATCTCATTTTCTAGAGCTCAGTCGGGTTGGATCTTTCGTGAGGACAGGAAGGAGACTGTTGCTGGCTTATATAAAAGTAATATTTACACTATTTCAGGCCTCGTTTTAGAGTCACGAAAGAGAAGAGAACATTTATCAACTGATGATTTGCAAAAAAATAAAGCAATTATAGAAAGTTTAACTAAGGGTAACACACAAAACTTGGACACCAATGGTGAGCCAGTGAGAAGAGCATCTCTGAACCCCCCGCCTGAGAGTAGCATTGACTGGGAAACATACATATCATCATCCCCCGGAGAGTACCCCGGTCTAGGAAGGGAGCTTGTGTACAAAGAATCATCCAGGAATTTCCGAGCTACAATTGCTATGAGCGATGATTTTCCTCTGAGTGTTGATATGCTGCTAAATGTGCTTGAAGTCATAGCACCTTTTAAACACTTTGCAAAATTACGTCAATTTGTAGCAATGAAACTGCCAAAGGGTTTTCCAGTCAAAATAGACATACCCATCCTCCCAACTGTAACTGCAAAAATTACATTTCAAAAATTTGAATTTCGTGATAATATTCCGGATGAGTTGTTTGTGATACCTGAAGACTATGTAGAAGATCCATTACGTTTCCCTGATTTATGA

Protein sequence:

>DPOGS204267-PA
MSELICSSENEVYPLHECVFMGDVRKLSSLLRFNDVTRKDKHGNTALHLAVMLGRKECVQLLLAHGAPVKVKNLAGWSPLAEAISYGDRQTISSLVRKLKQQAREQMEVRRPDLIRALSQIQNFYMELKWDFHSWVPLVSRILPSDVCKIYKSGSGIRLDTTLVDFTDMKWERGDVSFIFKGDKPPSQSLTVLDNKVKVYQHVRYEETENEIEDEVDLLMSSDILAAQMSTKGISFSRAQSGWIFREDRKETVAGLYKSNIYTISGLVLESRKRREHLSTDDLQKNKAIIESLTKGNTQNLDTNGEPVRRASLNPPPESSIDWETYISSSPGEYPGLGRELVYKESSRNFRATIAMSDDFPLSVDMLLNVLEVIAPFKHFAKLRQFVAMKLPKGFPVKIDIPILPTVTAKITFQKFEFRDNIPDELFVIPEDYVEDPLRFPDL-