Monarch geneset OGS2.0

DPOGS207363
TranscriptDPOGS207363-TA1272 bp
ProteinDPOGS207363-PA423 aa
Genomic positionDPSCF300188 + 454814-459072
RNAseq coverage127x (Rank: top 57%)
Annotation
HeliconiusHMEL0088690.091.02% 
BombyxBGIBMGA013761-TA2e-1634.39% 
DrosophilaCG4221-PA7e-10150.13% 
EBI UniRef50UniRef50_Q7QKJ78e-10351.60%AGAP003285-PA n=4 Tax=Pancrustacea RepID=Q7QKJ7_ANOGA
NCBI RefSeqXP_307793.41e-10351.60%AGAP003285-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3123723081e-10351.86%hypothetical protein AND_20333 [Anopheles darlingi]
NCBI nr blastxgi|3123723083e-10551.47%hypothetical protein AND_20333 [Anopheles darlingi]
Group
KEGG pathway 
InterPro domain[59-90] IPR0223648.2e-08F-box domain, Skp2-like
Orthology groupMCL14717 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207363-TA
ATGTCAAGCGGGAGTTGCTCGGGGCCGGGTGTGTGTGGTGGGGCGAGACGTAGACCTCCCCCTCCTCCCGAGCCTGCACTGTCCTTGGCTGCAGACCTGAGTGAGCTGAGCCTCGACCAGGGCTACCACACCCTGGCTGACTGCGGCCCCCCTAGGAGGAGGAGGGATAACACGCTCACTGACGCTCTGTGGCTGAAGATACTGTCCTACCTTGAGGTGTCAGATTTATGCCGAATGTCCCGGGTGTCCAGACGATGGTCCCGTTTCGTGGCCAGGCTGACTGCCAGACCTGAACCCTGGAGGCGGGTACGGGCTTCTGGTCCCCTGGAGGTGGCTGCAAGGTGCGCGGGTGCAAGAGCTGGACCCTGTGTTGGTGCTGTTAGGGAATGGAGGTCCAAGACCTGTTCTGTGACGGTAGCCGGAGCACAGTTATTAGCGGCGACCTTCAGGAACTTAACTCATTTAGCTTTAACGAACTCAAATACAGTGGACGCAAGAGCTTTGGCCCCAATTATCACAGATTTGGTTGACCTGCGCCATGTTGATTTAACAGGCTGTCCAAACATGGACTGGCCAGAGTGGAATTGGCTGGAAAGTCGTCTTACAAACAGACGGCCCCCCATAGAGTATATAGACCTCACTGACTGCACCGCTGTCACCGACGCCGGCCTCTGCGCTCTGCTCCACACCTGCCCTTCATTACAGTACCTTTATCTCAGGAGATGTACTCTAGTCACTGATGCAGGAGTCCGATGGATACCTTCATATTGCGCTCTCAAAGAACTGAGCGTGTCTGATTGTACGGGGGTCACGGACTTCGGTTTATACGAACTGGCGAAATTGGGGCCAGCGTTACGATACCTATCCGTTGCTAAATGTTCTCAGGTATCAGATTCAGGGGTCCGAACTCTCGCACGTCGCTGTTATAAGCTACGTTATTTGAACGCCCGTGGCTGTGGGGCGCTGGGGGATGATGGGGCCGAGGCCATCGCCAGGGGCTGCTCCAGATTAAGAGCTCTGGACCTTGGTGCCACTGATGTATCGGAGGCCGGGCTGCAGATTCTTGCCAGATGCTGTCCGAATCTTAAGAAGCTGGCGTTGAGAGGCTGTGAACTCATCGGCGACGATGGTTTGGAAGCCGTTGCGTATTATTGTAGAGGTTTGACACAGTTAAACATTCAGGACACGCCGGTTACATTAAGAGGATACAGAGCTGTTAAGAAGTACTGCAAGAGATGTGTCATAGAACATACAAATCCAGGATTCTGTTGA

Protein sequence:

>DPOGS207363-PA
MSSGSCSGPGVCGGARRRPPPPPEPALSLAADLSELSLDQGYHTLADCGPPRRRRDNTLTDALWLKILSYLEVSDLCRMSRVSRRWSRFVARLTARPEPWRRVRASGPLEVAARCAGARAGPCVGAVREWRSKTCSVTVAGAQLLAATFRNLTHLALTNSNTVDARALAPIITDLVDLRHVDLTGCPNMDWPEWNWLESRLTNRRPPIEYIDLTDCTAVTDAGLCALLHTCPSLQYLYLRRCTLVTDAGVRWIPSYCALKELSVSDCTGVTDFGLYELAKLGPALRYLSVAKCSQVSDSGVRTLARRCYKLRYLNARGCGALGDDGAEAIARGCSRLRALDLGATDVSEAGLQILARCCPNLKKLALRGCELIGDDGLEAVAYYCRGLTQLNIQDTPVTLRGYRAVKKYCKRCVIEHTNPGFC-