Monarch geneset OGS2.0

DPOGS206835
TranscriptDPOGS206835-TA1509 bp
ProteinDPOGS206835-PA502 aa
Genomic positionDPSCF300001 - 3328780-3336009
RNAseq coverage162x (Rank: top 52%)
Annotation
HeliconiusHMEL0096111e-8644.52% 
BombyxBGIBMGA012782-TA6e-6649.04% 
Drosophila% 
EBI UniRef50UniRef50_Q28C362e-2749.22%Novel protein n=8 Tax=Tetrapoda RepID=Q28C36_XENTR
NCBI RefSeqXP_001943669.13e-2538.10%PREDICTED: similar to MGC80116 protein [Acyrthosiphon pisum]
NCBI nr blastpgi|1479058062e-2750.00%TRAF-type zinc finger domain-containing protein 1 [Xenopus laevis]
NCBI nr blastxgi|1479058062e-2850.00%TRAF-type zinc finger domain-containing protein 1 [Xenopus laevis]
Group
KEGG pathwaytet:TTHERM_004312102e-21 
 K12842 (SR140)maps-> Spliceosome
Orthology groupMCL22308 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206835-TA
ATGGAAGAAGCGGAGAATAAGGTTTGCCATAACTGCAAGCGTGAGATACCTCTCGCTAACTTCACGATTCACGCGGTGCATTGCGCTCGCAATATCCGACTTTGTCCTGTTTGCAAGGAGCCGGTGCCTGTGCAGGACTTGCAGCAACATCATGACGACCAACACAAGCTTGTACCATGCAAGCAGTGCGGCGAGGACGTATGCGGTACCGATATGGAGGATCACGTGAGGGACTCGTGCGCACTCACCATGCAGACTTGCAGATTCTGTACCCTAGAGCTGCGTCGCCGCGAGCTGCCCGCCCACGAGCGGTACTGCGGCGCGCGCACCGAGCTGTGCGAGTGTGGCGAGTGGGTCATGATGAAGTATAGGCAGCTGCACATCGACTCCAACCACGGGTTTATCAGACTCGATGATGACCCAGTACCAATACAAAGTATTAAGGCGTCAGTGTCGAAGCAGAAGGTTACAACAGATAAAACACCTCTGTCAAACGGACTGAATAATTTAAATCGTTCAGCATTCCCACGGAAGAACAATTCAAGAGACTTGAACACTGCTGCTGATCAAACAGCCTCCACAAGTTCACAAAAACCCACCGCTATGGCCAGTACCAGTAAGGGGAACAGTTCAAAGACGAAAGAGGAAAATATTACGAACGCCAACGGAGCTATTCCCAGGGCGAAAAGAATAGAGAGACAGCATCCAGGGAATGATAGTAAAGATAAAGTAAAGAACTCTAGTTCCCGTGGTTCCATGAAGAAGCGGCCTGCTCCTCCTCCCCCCGCTTCCTCGGCGCCCGTGCTGCAGGCAGCACTCCAGCGACAGCAGCGAGAGGAAACGCAGCGGCAACTTCAGAACCAGGAGAATTTGGCAAGGGGTCTGCCGCCGATTCTGAACCCAGCTGAGAAATTAGAGAGGCTTCGGAAAATGGACGCTCTTCACCAAAGGGAACCAGACGACCAGTCATGGAAGAACCGGCTGCAAGGAAGAGTGTGGTCGAGGCCACATGTCTACCCCATCGGTCACGTCCTTAGTGGACAGACGGATGTGGCCGGTGAAGTGAGGAAGGAACTAAAAAACTTAAAGCCGATGACTCCCGAGGAGTTCAATGACAGATATAGAAACATTCAAAGTGAAAGACAGGACCGATTCAAAGAGATAAAGACCTCTTTAAGGGAACTAAGGAGAGGACTCAATGAGGTCATAGCGCCGTACAATTCAAACTCAAACGCAGACACCAATCACAGTCGTCATGAGGAGGAGGCTCCATGCGAGTTCTGCGGCACCAGCGTCCTGCTGGAAGATCTGGTCCTGCATCAGACCGGTTGCCGGCCGGACCTGGCCCAGTACCGCAGTCCGCCCCCCTCCCCTGGCCCTCGCCCCCCCGCAGAGCCCCCCGCCCCGCAGAGCTTCCCGCTCGCCGGATCCCCCCGCGCCTCTCATACCCTGCGAGTTCTGTACCAGATTGTTGCCCGTGCATCTCGTATGCCACCACCAGGTGACTGA

Protein sequence:

>DPOGS206835-PA
MEEAENKVCHNCKREIPLANFTIHAVHCARNIRLCPVCKEPVPVQDLQQHHDDQHKLVPCKQCGEDVCGTDMEDHVRDSCALTMQTCRFCTLELRRRELPAHERYCGARTELCECGEWVMMKYRQLHIDSNHGFIRLDDDPVPIQSIKASVSKQKVTTDKTPLSNGLNNLNRSAFPRKNNSRDLNTAADQTASTSSQKPTAMASTSKGNSSKTKEENITNANGAIPRAKRIERQHPGNDSKDKVKNSSSRGSMKKRPAPPPPASSAPVLQAALQRQQREETQRQLQNQENLARGLPPILNPAEKLERLRKMDALHQREPDDQSWKNRLQGRVWSRPHVYPIGHVLSGQTDVAGEVRKELKNLKPMTPEEFNDRYRNIQSERQDRFKEIKTSLRELRRGLNEVIAPYNSNSNADTNHSRHEEEAPCEFCGTSVLLEDLVLHQTGCRPDLAQYRSPPPSPGPRPPAEPPAPQSFPLAGSPRASHTLRVLYQIVARASRMPPPGD-