Monarch geneset OGS2.0

DPOGS213717
TranscriptDPOGS213717-TA1059 bp
ProteinDPOGS213717-PA352 aa
Genomic positionDPSCF300310 - 166053-168901
RNAseq coverage263x (Rank: top 41%)
Annotation
HeliconiusHMEL0176651e-11267.91% 
BombyxBGIBMGA011848-TA2e-11475.42% 
DrosophilaCG9018-PA2e-9149.60% 
EBI UniRef50UniRef50_Q5TRR32e-9754.62%AGAP005331-PA n=2 Tax=Diptera RepID=Q5TRR3_ANOGA
NCBI RefSeqXP_001599449.14e-10558.66%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3123726058e-10355.87%hypothetical protein AND_19932 [Anopheles darlingi]
NCBI nr blastxgi|3320289226e-10757.51%Regulation of nuclear pre-mRNA domain-containing protein 1B [Acromyrmex echinatior]
Group
KEGG pathway 
InterPro domain[8-131] IPR0065693.2e-42RNA polymerase II, large subunit, CTD
[2-130] IPR0089424.1e-22ENTH/VHS
[57-119] IPR0069031.7e-21Domain of unknown function DUF618
Orthology groupMCL12120 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213717-TA
ATGTCCGGTTTTACCGAAAATGCGTTGATTCGAAAGTTACAGGAACTTAATTCAAGTCAACAAAGTATACAAACATTATCACTATGGCTAATACATCACCGTAAACATCATGCCGCTATTGTTAAAACATGGTATAAGGAACTTCTGAAAGCTAAAGACAGCAAACAAGTAACGTTCATGTACCTAGCTAACGATGTTATACAAAATAGTAAGAAGAAAGGTCCGGAATATGGCAAAGAGTTTGGGGCCGTGTTAATAGATGCACTGAAGCACATGGCAAAGACTGGTATCAATGCAAAAACGAAACATGCACTACATAGAATATTAAATATATGGGAGGAGAGGGGTGTGTATGAACCGGAAAGAATACAGGAATTCAAGGTTGCTGTTGATCCAAATGATGCTGAAGTTACAAATGCTAAAAGAAAGGCTGTTGATATAGAAACAAAAGACAATGTAAAAAAATCTAGGCAGGAGGAGAAGACGAGGCAAAAAGAACATAAAGAAAGAAGAAAAAGTGATTCTAAACTTGAATCAAAGGCTGGTTCTGATGGACATAATGACAATCATTCAAGCAGCCCTAAGACACCACCAGGCGATCCGCCGGAACCTGAAGAACTTATCAAGGCCCTACTGGAACTTGAATCCAGTGCTTCAAGTGATGAAGCTGTCCGAGAACGTATCGCCTCCCTCCCCCCAGAGGTTTCCGAAGTGCAATTATTATCTAAGTTAGAAGATAAAGAATCAGCTCTAAGTCTTAGCGCTGTAGTTAATTCAGCAGTAGAATTACTGGCCGAGTATAATTTAAGACTATCCGAAGAATTGGAGAAACGGCGAAAGGTGGCAACGATGTTAAGGGACTTTGAACAAGCACAGAGGGAACTTGCAAAGAAAGCTGAGGCTACCTTAGAGGAGTACAACATAAAACTTCAAAAAATCTACGAAGTGAAAGCCGAGGTGAAGTCTCATATAGAGAATTTGCCAGATGTTTCCCGCCTGCCTGATGTTACAGGAGGTTTGGCGCCCTTACCCTCCGCTGGGGACCTTTTCAGTGTTTGA

Protein sequence:

>DPOGS213717-PA
MSGFTENALIRKLQELNSSQQSIQTLSLWLIHHRKHHAAIVKTWYKELLKAKDSKQVTFMYLANDVIQNSKKKGPEYGKEFGAVLIDALKHMAKTGINAKTKHALHRILNIWEERGVYEPERIQEFKVAVDPNDAEVTNAKRKAVDIETKDNVKKSRQEEKTRQKEHKERRKSDSKLESKAGSDGHNDNHSSSPKTPPGDPPEPEELIKALLELESSASSDEAVRERIASLPPEVSEVQLLSKLEDKESALSLSAVVNSAVELLAEYNLRLSEELEKRRKVATMLRDFEQAQRELAKKAEATLEEYNIKLQKIYEVKAEVKSHIENLPDVSRLPDVTGGLAPLPSAGDLFSV-