Monarch geneset OGS2.0

DPOGS206972
TranscriptDPOGS206972-TA1479 bp
ProteinDPOGS206972-PA492 aa
Genomic positionDPSCF300001 + 216757-220794
RNAseq coverage409x (Rank: top 30%)
Annotation
HeliconiusHMEL0143600.085.25% 
Bombyx% 
DrosophilaCG18005-PA4e-13051.78% 
EBI UniRef50UniRef50_B0W6461e-15458.76%Red protein n=3 Tax=Endopterygota RepID=B0W646_CULQU
NCBI RefSeqXP_001844180.12e-15558.76%red protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700326245e-15458.76%red protein [Culex quinquefasciatus]
NCBI nr blastxgi|1571238401e-16060.47%red protein (ik factor) (cytokine ik) [Aedes aegypti]
Group
Gene OntologyGO:00056348.2e-79nucleus
KEGG pathway 
InterPro domain[64-299] IPR0129168.2e-79RED-like, N-terminal
[376-492] IPR0124923.1e-53RED-like, C-terminal
Orthology groupMCL15305 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206972-TA
ATGGAAGACAGAAGTGTTGAAGACACTCCGGTGTCTCAGAGGCTTACTAACGATGACTTCAGAAAGTTGTTGATGACTCCTCGATCCACTCCATCTGGTGGCAGTCGGGGGGATGGATCTATTCGGGAAGCCATGGCTAGTGCTGGGACCATGCCTCCACCGGTAGAAAACAAGAGTGAACTGAGACGTAAAAAGAAATCTTACTATGCCGCTCTGAAGAAGCAGGAAGACAATAAGCTTGCAGAGTTGGCAGAGAAGTATAGAGATCGTGCAAGGGAAAGACGTGACGGTGTGAACGATTTGGTCGTTGACCCCACAAGCAACACTAGCAGCGCTTACAGGGCCGTAGCTCCGGATGCTAAGTCCGGAATGGATGCTAAGGAACGGCGACGACAAATGATCCAGGAATCCAAATATCTTGGTGGTGATATGGAGCACACCCACTTGGTGAAGGGTCTTGATTATGCTCTGCTGCAGAAGGTCCGTTCTGAAATCCAAACGCGGGAAAACGAGCAAGAATTGGAAATGGAACGACTTGTATCAGTTCCCATTGAAGCGATACCAGAGAAGAAAGAAGTGTTGCCAGTGGAAGAAGAAATACAGTTCGAAAACCCAATGGCTAGAAATATTTACAATCTCATTATAGAGCAGAAGAACAAGAAGATTACTCGCAACGACCTGTTCGCTCCAGGTCGTATGGCTTATGTTGTTGAATTAGATGATGAAGGGACCATAGACAGCGATATACCAACAACCCTGACGAGGAGCAAGGCGGATGTTCCGGAAATGGATGAACGGACATCAGGCTCCGCCTCCAACGACGTGGTCATCGAGAAGCTGTCTCAGATATTCTCGTATTTGAGACACGGGCGGCACAGGAAGCTTAAGAAGACGAAGGACAAAAATACAGAGAAGGGTCGCAACGACGACTCTATATATGGCGATATTGGGGATTATGTAGCGGATGACAGGCAAAGTGACCGACGCGATGAACGACCGAGGACTGGATACTTTGACAAACCACAGGAGACAGAGAAGGAAGAAGGGCCACTTGTAGGTCGTCGTACGGAGCGCGACCGTCGCTCGGCGGCGCTCCTGTCCCGGTTGGCAGCGGAGCCCGAGGGCTACGCTGAGTGTTACCCCGGGCTGAGGGAAATGAATGACGCCATTGACGATTCAGACGACGAGGTCGACTACACCAAGATGGACGCCGGAAATAAGAAGGGACCTATCGGTCGTTGGGATTTCGACACTCAAGAGGAATATTCTGACTACATGAGTAGCAAAGAAGCGCTTCCAAAGGCTGCTTTCCAATATGGCGTCAAAACACAAGATGGCCGTAAGACGAGGAAGACGAAGGATAAGAGCGAAAAAGCCGAACTCGACAGGGAATGGCAACAGATACAAAATATTATACAGAAGCGTAAAACACCACAGTATCCTGGGGATGAGAGTAACTTTAAAAAACCTAGATATTAA

Protein sequence:

>DPOGS206972-PA
MEDRSVEDTPVSQRLTNDDFRKLLMTPRSTPSGGSRGDGSIREAMASAGTMPPPVENKSELRRKKKSYYAALKKQEDNKLAELAEKYRDRARERRDGVNDLVVDPTSNTSSAYRAVAPDAKSGMDAKERRRQMIQESKYLGGDMEHTHLVKGLDYALLQKVRSEIQTRENEQELEMERLVSVPIEAIPEKKEVLPVEEEIQFENPMARNIYNLIIEQKNKKITRNDLFAPGRMAYVVELDDEGTIDSDIPTTLTRSKADVPEMDERTSGSASNDVVIEKLSQIFSYLRHGRHRKLKKTKDKNTEKGRNDDSIYGDIGDYVADDRQSDRRDERPRTGYFDKPQETEKEEGPLVGRRTERDRRSAALLSRLAAEPEGYAECYPGLREMNDAIDDSDDEVDYTKMDAGNKKGPIGRWDFDTQEEYSDYMSSKEALPKAAFQYGVKTQDGRKTRKTKDKSEKAELDREWQQIQNIIQKRKTPQYPGDESNFKKPRY-