Monarch geneset OGS2.0

DPOGS211400
TranscriptDPOGS211400-TA999 bp
ProteinDPOGS211400-PA332 aa
Genomic positionDPSCF300115 - 350985-355341
RNAseq coverage585x (Rank: top 22%)
Annotation
HeliconiusHMEL0080833e-12568.17% 
BombyxBGIBMGA010858-TA1e-8680.00% 
DrosophilaCG6486-PB1e-7144.32% 
EBI UniRef50UniRef50_Q9VSN72e-6944.32%CG6486, isoform A n=15 Tax=Drosophila RepID=Q9VSN7_DROME
NCBI RefSeqXP_969864.11e-9554.01%PREDICTED: similar to AGAP006264-PA [Tribolium castaneum]
NCBI nr blastpgi|910907822e-9454.01%PREDICTED: similar to AGAP006264-PA [Tribolium castaneum]
NCBI nr blastxgi|910907824e-9654.01%PREDICTED: similar to AGAP006264-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055152e-46protein binding
KEGG pathwaytca:6583773e-95 
 K13341 (PEX7, PTS2R)maps-> Peroxisome
InterPro domain[12-331] IPR0110462e-46WD40 repeat-like-containing domain
[12-327] IPR0159434.5e-46WD40/YVTN repeat-like-containing domain
[133-173] IPR0016806.4e-09WD40 repeat
[96-130] IPR0197811.5e-08WD40 repeat, subgroup
Orthology groupMCL17407 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211400-TA
ATGCCAACCTTCCTGACACCTGGCCGGCATGGCTACAGTGTAAGATTTTCTCGAACGAACCCCGATACCCTGGCAGTAGCCACAAGCCAGTATTATGGTTTAGCTGGTGGTGGAACACTGTTCTTTTTGGAACTAACACAAGACGGAAGCAATCTCGTTGAGTTACAGAAGATTGAATGGAGCGATGGCCTCTTTGATGTGTCGTGGTCGGGGTGCACGGAGGGCTTCGCTTCGTGCGGGGCGGGCGACGGAGCGGTGCTCGTTTGCCGGGCCGGGTGCTCCGCCCCCCTCAGAGTGTTGAGAGGCCACCGAGGGGAGGTGTGCTCCGTGGATTGGCCCGGGAGACAGCTCTTGAGCGCCAGCTGGGACACTACCGTCAAATTGTGGGACCCCGAGTCCGAGGCGTGTATCAGTACGTTCTCGGGTCACTCTCAGCTGGTGTACTCGGCGTCCTTCTCCCCTCACTCCCCGGGGACCTTCGCTTCCGTGTCCGGGGACGGTCACCTCAAGCTGTGGTCGTGTTCGGAGCAACGCCCTATAGCCGTCATCAAAGCACACGATGCTGAGGTCTACCACAAAGACCATGAGGGACCAACTAAACTGTACAGTATCCCTGAGTCTCCTGTTTCCACTTCCTGGCTGTCGTGTGACTGGAGCGGGGCGGAGAGTCGTCTGGTGGCGAGCGCCGGCTCCGACGGGTTGGTGAAGGGCTGGGACCTCCGGAGCCTCGCCGCTCCCGTCTTCACACTCAGAGGTTGTGAACGCGCCGTTCGTCGCGTGCAGTTTTGTCCGCACGCGCCGGCCGTGCTCGCCGCCGTCTCCTACGACTTCACCACCAGGATCTGGGACCTAAAGCTGGGTTGGTCTCCGTTGGAGACTATCCGTCACAGGTCGGAGTTCACGTTCGGTCTAGACTGGAGCGCGCTCCGGCCTCGCTCGCTGGCAGACTGCGGCTGGGACTCCCTGGTGCACGTGTTCGTCCCTAGAACACTCATGTGA

Protein sequence:

>DPOGS211400-PA
MPTFLTPGRHGYSVRFSRTNPDTLAVATSQYYGLAGGGTLFFLELTQDGSNLVELQKIEWSDGLFDVSWSGCTEGFASCGAGDGAVLVCRAGCSAPLRVLRGHRGEVCSVDWPGRQLLSASWDTTVKLWDPESEACISTFSGHSQLVYSASFSPHSPGTFASVSGDGHLKLWSCSEQRPIAVIKAHDAEVYHKDHEGPTKLYSIPESPVSTSWLSCDWSGAESRLVASAGSDGLVKGWDLRSLAAPVFTLRGCERAVRRVQFCPHAPAVLAAVSYDFTTRIWDLKLGWSPLETIRHRSEFTFGLDWSALRPRSLADCGWDSLVHVFVPRTLM-