Monarch geneset OGS2.0

DPOGS207120
TranscriptDPOGS207120-TA1560 bp
ProteinDPOGS207120-PA519 aa
Genomic positionDPSCF300001 + 3401629-3412939
RNAseq coverage1056x (Rank: top 12%)
Annotation
HeliconiusHMEL0096030.068.74% 
BombyxBGIBMGA012489-TA2e-13447.72% 
Drosophilacoro-PB0.074.29% 
EBI UniRef50UniRef50_Q7JVY00.074.29%Coro, isoform A n=24 Tax=Bilateria RepID=Q7JVY0_DROME
NCBI RefSeqXP_002004238.10.075.35%GI19814 [Drosophila mojavensis]
NCBI nr blastpgi|1951194380.075.35%GI19814 [Drosophila mojavensis]
NCBI nr blastxgi|1951194380.074.90%GI19814 [Drosophila mojavensis]
Group
Gene OntologyGO:00055158.8e-37protein binding
KEGG pathway 
InterPro domain[1-494] IPR0155055.6e-285Coronin
[255-390] IPR0150492.1e-54Domain of unknown function DUF1900
[77-297] IPR0159438.8e-37WD40/YVTN repeat-like-containing domain
[4-68] IPR0150481.6e-34Domain of unknown function DUF1899
[78-109] IPR0197811.3e-08WD40 repeat, subgroup
[63-110] IPR0016807.3e-07WD40 repeat
Orthology groupMCL10445 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207120-TA
ATGTCGTTCAGAGTGGTTCGTAGTTCAAAATTCCGCCATGTATATGGTCAGGCCCTTAAAAGGGAACAGTGTTATGATAACATCAGAGTATCGAAAAGTTCGTGGGACTCTACGTTTTGTGCTGTAAACCCCACTTTCCTGGCCATCATTGTAGAGTCAGCCGGCGGCGGAGCCTTCATAGTTTTACCGCATAATAAGGTCGGCCGCATACCAGCAGATCATCCTCTAGTGGGAGGACACAAGGGTCCAGTGTTGGACATAGCGTGGTGCCCCCACAACGACAACGTCATCGCCAGCGGCTCCGAGGACTGCGTTGTTAAGGTATGGCAAATACCGGACGGTGGACTGACCCGTACGTTGACGGAGCCTGTGGTGGATCTCGTGTATCATCAGCGACGCGTGGGCTTGGTGCTATGGCATCCAACCGCTCAGAACGTGTTGCTCACTGCTGGCTCCGATAACCAGATAGCGATCTGGAATGTTGGCACTGGCGAGGTCTTGCTGAGTCTGGACTGTCACCCTGACCTCATCTACTCCGCTTGCTGGAACTGGACGGGCTCCAAGCTGCTCACCACCTGCCGTGACAAGAAGATTAGGATAATAGATCCTCGCAAGGGAGAAGTGGAATCAGAGGCCATAGCTCACGAAGGCAGCAAAGCGTCCAGAGCAATCTTTTTAAAGCATGGACTGGTGTTTACCACTGGATTCAGTCGCATGTCCGAGCGTCAGTACACTCTCCGTACACCGGACGCCCTCGGAGAACCTATCGTGACGGTGGAGATTGACACAAGTAACGGAGTCATGTTCCCACTCTACGATCCCGACACCAATCTCATCTACCTCTGCGGGAAGGGCGATTCGGTCATCAGATATTTTGAGGTCACCCCAGAGCCGCCTTTCGTCCACTACATTAACACCTTCCAAACACCGGACCCACAGAGAGGTATTGGTATGATGCCCAAGCGCGGCTGTGACGTAGCTACGTGCGAAATAGCGAAGTTTTACAGACTTAACAACTCTGGTCTCTGTCAGGTGGTTTCGATGACCGTGCCGCGTAAGTCTGAGTTGTTCCAAGAGGACTTGTACCCTGACACATTGTCCGATGAAGCTTCATTGACGGCCGACGAGTGGCTCGCGGGTGAAGACGCCGAACCCTGCACCATGTCGCTGAAGGAGCGTGCGCTTATTGTACAGGGTGGTTACGTAGCGGGAAGGGCGCACAACCTCACCGTGACCAAGAGGAACGCGCTGGCGACCGCCAGGGATAAGGAAAAGGAGAAGGAGAAGGAAAAGGATAAGGAGCCCGAGAGGAGCCCCACCCCGGGCCAGAGGGACACACCCGCCACGCCGGCCGCCACCCCACCGCCAGCCTTCACCGCTATGGTGGAGAAACAACTATCGGACCTGGTGGAAGAGATCCGTAAGCTGAAATCGGTTATAGTGAAGCAAGAGAACCGTATACGGGCACTAGAGGCTACGGTTAAGGGACAAGTGGCTGCAGCCACACCAGTACCCGCTGATCACAACCACGACGACAACATGGCGCCCGACGAGGTCTGA

Protein sequence:

>DPOGS207120-PA
MSFRVVRSSKFRHVYGQALKREQCYDNIRVSKSSWDSTFCAVNPTFLAIIVESAGGGAFIVLPHNKVGRIPADHPLVGGHKGPVLDIAWCPHNDNVIASGSEDCVVKVWQIPDGGLTRTLTEPVVDLVYHQRRVGLVLWHPTAQNVLLTAGSDNQIAIWNVGTGEVLLSLDCHPDLIYSACWNWTGSKLLTTCRDKKIRIIDPRKGEVESEAIAHEGSKASRAIFLKHGLVFTTGFSRMSERQYTLRTPDALGEPIVTVEIDTSNGVMFPLYDPDTNLIYLCGKGDSVIRYFEVTPEPPFVHYINTFQTPDPQRGIGMMPKRGCDVATCEIAKFYRLNNSGLCQVVSMTVPRKSELFQEDLYPDTLSDEASLTADEWLAGEDAEPCTMSLKERALIVQGGYVAGRAHNLTVTKRNALATARDKEKEKEKEKDKEPERSPTPGQRDTPATPAATPPPAFTAMVEKQLSDLVEEIRKLKSVIVKQENRIRALEATVKGQVAAATPVPADHNHDDNMAPDEV-