Monarch geneset OGS2.0

DPOGS201250
TranscriptDPOGS201250-TA2460 bp
ProteinDPOGS201250-PA819 aa
Genomic positionDPSCF300037 + 143920-146379
RNAseq coverage396x (Rank: top 30%)
Annotation
HeliconiusHMEL0027720.090.37% 
BombyxBGIBMGA014340-TA0.087.59% 
DrosophilaCG11526-PB0.057.90% 
EBI UniRef50UniRef50_Q8IRD50.057.90%AT25231p n=29 Tax=Metazoa RepID=Q8IRD5_DROME
NCBI RefSeqXP_001815164.10.065.09%PREDICTED: similar to CG11526 CG11526-PA [Tribolium castaneum]
NCBI nr blastpgi|1892339490.065.09%PREDICTED: similar to CG11526 CG11526-PA [Tribolium castaneum]
NCBI nr blastxgi|1892339490.065.09%PREDICTED: similar to CG11526 CG11526-PA [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[401-797] IPR0218195.7e-128Protein of unknown function DUF3402
[33-312] IPR0124861.6e-72N1221-like
Orthology groupMCL11363 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201250-TA
ATGGGCACTAACGGGAACGGTCGTCGTGGCGGCACGAAGCTGCCTGACGTTCGTCGTCCCATGGAAGAAGGAGACTGTGTGAACGACATGGAAAGTCCTGAATTGGATTTCGTTTATGATGATTGTGATACGCAAGCTAATGAAATTGCAGAACTCTATAGTTACACTGAACACCCTGAATTTCAACTGAACGTTAAAGCTTTTGAGGACATTATGGAGGAATTTAACTTGCCTCCTAGTTGGCAAAGACTTTCGAACCAACAACAACGCACTGTAGTGATGAAGCTACTTGAGCATCTTGATGTTGCAGTTACAGAGATACGAATGCGAGCGACGAGATGTATACTCTACCTAGCACAAGGCTGCTGGGCCGAAATGCAGTCCGATGCCGAACAGGCGCATTGGGCACGTATTAATATTATGACACTCTATGAAATGGGGACATTTGCTGCATTCGTTGAACTTTTGAATTTGGAAATAGTAAGAAGTAACGCAGCTGTAGCATCTAGAAAGCTAGCAGAGTCATTAGTAGATTCAACCAACTTGAGAGTGATACTGTCAGTATTATACCTCATTACGGAACACATGCGAACTGAAAAAGATAATCCAGATTCCGAGTATTCAGATCTAGTTAAAAACTTCAAAGAAGAAATAGGAACACCTTTTAATGAAGAATTACTGCCGGTGAAGCTCTTGGGTATGGTAACACAGCTATGTGTTGGAACTACACCCCATTTTCCTATGAAAAAAGTGCTTTTACTGCTCTGGAAGATATTACTAGTGTTTCTTGGTGGTTCTAAAGAATTGAAAGAACGTAAAGCTAGATTGCGTGAAAAGTATGGTTTACCACCTTGTACTGAAGATACCCTTGAAATAATAAAGGGCATGCGTTCAAGTTCTCCACCACCGAGTGCAGCTGACATGTTGGAAAACCAGAATCCTGGGCAGAGGAAGTTAAAAGAGAATGAAAGAGCTCTTAGGCGACAGACATTTATGAAACAAACTTCTCTTGATGAATCTGATGAACAAATCTTCGTAGACAAAGAGGAAACCAGTAACGGATCTGAAGATTACTCTATGGAATTTCAGTCAATGTCAGCTGAGGGAGTTCAGAATGCCAACACAAACCAGAACTGCCCATACTACATGTATATAAAAAGATTAGACAGTCCCCCACCCCCTCCACCACTCCCTAGAAGCCTGCCCTGGAGATCAAAAGTCCGACAAAAAGACATAGACACATTTCTTGATAATGTAAGAGTGAAATTTGTAGGTTACTCGCTGCCTGGTGATAGACAAACCATTGCTGGCCTTCCACAACCTATACATGAAGGAATCGAGATATTAAAAAAGCATACATACACAAGCCTCGCTGAGGTTCAGGCCGAGAGGGAGCTGGAGATTATCCGCAGCCCTCTCACGAAAGGCGAAAAAGAAGTTGAGGAAACTGAAGCGGAGATATTATATAGAGCTGTACAGCCAAACCTTCCACAGTATATAATCGCTCTACTTAAAATTCTCTTAGCCGCCACACCCACGTCAACGGCGAAAACATACTCCATGAATATAATGGCTGACGTGTTGCCAGAAGAGATGCCAATGACCGTGCTACAATCACTCAAACTGGGTATAGATGTTAATAGACATAAAGAGATCATAGTTAAAGCCGTGACCGCCATCCTGCTGCTTCTGTTGAAACATTTCAAATTAAATCATATATATCAATTTGAATTCATGTCACAGAATCTAGTATTCTCTAACTGCATGCCTCTCGTTTTAAAATTCTTCAACCAGAACATCTTATCGTATATAGGAGCTAAGAATTCAATACCCATATTTGATTTCCCCGCTTGTGTTATCGGCGAGCAACCGGAATTGACGAGAGATTGTTTGGACATCGGAGACTCGTCAGTACCATACTCGTGGAGGAACGTGTTCTCATGCATAAACTTGTTGCGAATTCTCAACAAACTCACTAAGTGGAAGAACGCTAGAATCATGATGCTTGTCGTATTTAAGAGCGCTCCGATACTAAAACGTACACTCAAAGTCCGTCACGCTCTCATGCAGTTCTATGTATTGAAGCTGCTCAAAATGCAAACGAGATATTTGGGGCGACAGTGGAGGAAGACGAATATGAAGACTATTAGTGCCATTTACTCCAAAGTCCGGCATCGGTTGAACGATGACTGGGCGTTCGGTAACGAGGTCGATGCTCGGCCATGGGATTTCCAGGACGAAGAATGTGCATTGAGAGTAAGCGTAGAGAGATTCAATCAGAGACGGTATGGAAACGCCAGTGAACTGGAAACTGAACTCACCCCGGTGGACACAGATATAAATAGTGTTCTCGATAGTAATATAGAGCTGGACGAGGAATTCAAGTCTAACTATGAGTTGTGGCTGGAGCAAGAGGTGTATAACAATGAAATCAACTGGGACGTTCTGCTCTCCACATAA

Protein sequence:

>DPOGS201250-PA
MGTNGNGRRGGTKLPDVRRPMEEGDCVNDMESPELDFVYDDCDTQANEIAELYSYTEHPEFQLNVKAFEDIMEEFNLPPSWQRLSNQQQRTVVMKLLEHLDVAVTEIRMRATRCILYLAQGCWAEMQSDAEQAHWARINIMTLYEMGTFAAFVELLNLEIVRSNAAVASRKLAESLVDSTNLRVILSVLYLITEHMRTEKDNPDSEYSDLVKNFKEEIGTPFNEELLPVKLLGMVTQLCVGTTPHFPMKKVLLLLWKILLVFLGGSKELKERKARLREKYGLPPCTEDTLEIIKGMRSSSPPPSAADMLENQNPGQRKLKENERALRRQTFMKQTSLDESDEQIFVDKEETSNGSEDYSMEFQSMSAEGVQNANTNQNCPYYMYIKRLDSPPPPPPLPRSLPWRSKVRQKDIDTFLDNVRVKFVGYSLPGDRQTIAGLPQPIHEGIEILKKHTYTSLAEVQAERELEIIRSPLTKGEKEVEETEAEILYRAVQPNLPQYIIALLKILLAATPTSTAKTYSMNIMADVLPEEMPMTVLQSLKLGIDVNRHKEIIVKAVTAILLLLLKHFKLNHIYQFEFMSQNLVFSNCMPLVLKFFNQNILSYIGAKNSIPIFDFPACVIGEQPELTRDCLDIGDSSVPYSWRNVFSCINLLRILNKLTKWKNARIMMLVVFKSAPILKRTLKVRHALMQFYVLKLLKMQTRYLGRQWRKTNMKTISAIYSKVRHRLNDDWAFGNEVDARPWDFQDEECALRVSVERFNQRRYGNASELETELTPVDTDINSVLDSNIELDEEFKSNYELWLEQEVYNNEINWDVLLST-