Monarch geneset OGS2.0

DPOGS203216
TranscriptDPOGS203216-TA1830 bp
ProteinDPOGS203216-PA609 aa
Genomic positionDPSCF300035 + 926084-931374
RNAseq coverage399x (Rank: top 30%)
Annotation
HeliconiusHMEL0064920.084.75% 
BombyxBGIBMGA011506-TA0.073.86% 
DrosophilaCG17754-PA0.056.91% 
EBI UniRef50UniRef50_Q7PXP70.059.80%AGAP001513-PA n=22 Tax=Coelomata RepID=Q7PXP7_ANOGA
NCBI RefSeqXP_001977340.10.056.75%GG18306 [Drosophila erecta]
NCBI nr blastpgi|1948905630.056.75%GG18306 [Drosophila erecta]
NCBI nr blastxgi|3479660660.059.80%AGAP001513-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00055157.1e-30protein binding
KEGG pathway 
InterPro domain[1-608] IPR0170967.4e-211Kelch-like protein, gigaxonin
[285-604] IPR0159165.1e-81Galactose oxidase, beta-propeller
[169-271] IPR0117056.7e-36BTB/Kelch-associated
[47-164] IPR0113331.5e-33BTB/POZ fold
[61-163] IPR0130697.1e-30BTB/POZ
[67-164] IPR0002104.4e-27BTB/POZ-like
[459-505] IPR0066523.4e-17Kelch repeat type 1
Orthology groupMCL10762 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203216-TA
ATGTCAGCCTGTGAATCTCAAAAGCCGGAGGTATCAAAAGACAATGTATCAACGAACTCAACTGACGAGGGACTGCCTCGAGAATTGAGTCAACTTAGCCTTTCAAGCGGATCAAATTCAGATGAATTCCATTGTAATAAAGGACATTCCGAAACAACAATGAAAAATATTTATGGATACTATCAATCTCAAAAACTTTGTGACGTTGCGTTAATTGCTAGCGGTTGTAGGATACCAGCACACAAAGTTGTATTAGCATCATGCAGTGAATATTTTGCAGCCATGTTCACTGGCTCTCTAAGAGAAGCTCAATCATCAGAAATAACTCTTGAGAGAGTTGACTCACAGGCACTTAGATCACTTGTACACTATTGTTATACAGGCATTATAGAGTTAAGTGAAGATACTGTTGAGATACTCCTGTCAACAGCGAGCTTGTTGCAATTGCATTCAGTTACAAAAGCATGTTGTGATTTTCTTGAAAAGCAATTGGACCCATGTAATTGTTTAGGAATAGCTTTATTTGCTGAACAACAGTCATGTATGGGTTTACATAAAAGTGCTTTGGAATATACATACCAACATTTCATGCAGGTAGTAAAGCAACAGGAATTTTTAACTTTACATGTGGATCAGGTGGCAAATTTACTTAAATGTGATGATCTCAATGTCATGACAGAGGAAAATGTGTTTGAAAGTCTTATGGCCTGGGTGCAGCATGATAATGCTAGTAGAAAACAACACCTGCCAGCATTATTAAAACTAATTAAACTGCCACTTTTATCGTCGGAATATCTTATAGACAAAGTTGAGCAATTATGTGGCGAGGTAACAGAATGTCAACCACTTATAATGGAGGCAGTGAAATGGCATCTTCTGCCTGAAAGGAGGTCGATGTTGTTTTCTCATAGAACAAGACCAAGGAAATCAACGATAGGGAGGCTTCTGGCCATAGGTGGGATGGACGGATACAAGGGTGCCAGCAATATGGAAATGTACGATCCACGAACAAACACTTGGACGCCATTTATGAAGATGGGTGCTAGAAGACTTCAATTTGGTGTTGCCGTGATGCAAAATAAACTTATAGTTGTCGGAGGAAGAGATGGATTAAAAACTTTAAATACAGTGGAATGTTTTGACCTGACGTCTCTCAGTTGGAGTACCCTAGCTCCTATGAACACTCACAGACATGGCTTGGGTGTGGCTGTCCTTGGTGACGGACCCAACTCGCCAATATACGCTGTTGGCGGACACGACGGATGGATATACTTGAATTCAGTGGAAAGATGGGATGCCTGTTCACGTACATGGACAATGGTGTCAGCTATGGCGGGCGCTCGTAGCACGTGTGGCGTAGCGGCGCTACGAGGTCGCCTCTACGCGGTGGGGGGAAGGGACGGTGGTGCCTGTCTGCGATCCGTTGAGTGCTACGATCCAGCTACCAACCACTGGACAAATTGCGCGCCGATGACACACAGACGTGGCGGTGTTAGTGTTTGCGCTGCAGGCGGGTATTTATACGCATTAGGGGGACACGAAGCCCCCGCTAACACTGTGGGTGGAAGACTCGCTTGTGTGGAACGATATGACCCCATCACTGATAGCTGGGTGCTACTAGCAAGGTTGTCTTACGGACGTGACGCTATCGGAAGTTGTCTACTAGGTGACAGAATAGTCGCTGTCGGCGGTTACGACGGTGTGCAGTATCTCTGTGTGGTGGAGGTTTACGATGCGGAGGCCAACACTTGGAAGAAGCTGTCACCATTGAGTACCGGGAGGGCGGGCGCAGCGGTGGTAGCGGTGCCGCCTCCACGCAACCTCTTGTGA

Protein sequence:

>DPOGS203216-PA
MSACESQKPEVSKDNVSTNSTDEGLPRELSQLSLSSGSNSDEFHCNKGHSETTMKNIYGYYQSQKLCDVALIASGCRIPAHKVVLASCSEYFAAMFTGSLREAQSSEITLERVDSQALRSLVHYCYTGIIELSEDTVEILLSTASLLQLHSVTKACCDFLEKQLDPCNCLGIALFAEQQSCMGLHKSALEYTYQHFMQVVKQQEFLTLHVDQVANLLKCDDLNVMTEENVFESLMAWVQHDNASRKQHLPALLKLIKLPLLSSEYLIDKVEQLCGEVTECQPLIMEAVKWHLLPERRSMLFSHRTRPRKSTIGRLLAIGGMDGYKGASNMEMYDPRTNTWTPFMKMGARRLQFGVAVMQNKLIVVGGRDGLKTLNTVECFDLTSLSWSTLAPMNTHRHGLGVAVLGDGPNSPIYAVGGHDGWIYLNSVERWDACSRTWTMVSAMAGARSTCGVAALRGRLYAVGGRDGGACLRSVECYDPATNHWTNCAPMTHRRGGVSVCAAGGYLYALGGHEAPANTVGGRLACVERYDPITDSWVLLARLSYGRDAIGSCLLGDRIVAVGGYDGVQYLCVVEVYDAEANTWKKLSPLSTGRAGAAVVAVPPPRNLL-