Monarch geneset OGS2.0

DPOGS206937
TranscriptDPOGS206937-TA2319 bp
ProteinDPOGS206937-PA772 aa
Genomic positionDPSCF300001 - 848586-853282
RNAseq coverage115x (Rank: top 58%)
Annotation
HeliconiusHMEL0154592e-4734.19% 
BombyxBGIBMGA012879-TA9e-4484.04% 
DrosophilaSnm1-PA1e-6843.24% 
EBI UniRef50UniRef50_B0XKM07e-7647.85%DNA cross-link repair 1A protein n=1 Tax=Culex quinquefasciatus RepID=B0XKM0_CULQU
NCBI RefSeqXP_001635152.15e-8146.06%predicted protein [Nematostella vectensis]
NCBI nr blastpgi|1563901869e-8046.06%predicted protein [Nematostella vectensis]
NCBI nr blastxgi|1563901866e-7846.18%predicted protein [Nematostella vectensis]
Group
KEGG pathwaygga:4307646e-19 
 K10887 (DCLRE1C, ARTEMIS, SCIDA)maps-> Non-homologous end-joining
    Primary immunodeficiency
InterPro domain[649-738] IPR0110845e-26DNA repair metallo-beta-lactamase
Orthology groupMCL30965 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206937-TA
ATGAATGATGATATAAATGATTATCTGCCATCTTTATTGAATCCAAGTGGCATTAAGAGAAACAGTGCTTTATCACAAACTCAGGTTTCTTTGTCGCTAAGAACAGGTAAAATTAAATTAAATCAACCACTAGGTGAGGTTCCCCAATTGGAAGAGACAGCTACAACCGAACCAGCTGCCAGCACCATGGAAGGTGTATCAAGTATAACATGTAATAATAATGGCCAATTAGGAATTACTCCTATCGCTGAACTAAATAATGATACACCAGTTTCACTCAATTTTAACAATATAGACATAGAAGTTCCTAAAACTAACACCGACGACAATGAGAATGACAATAATCAAAATTCTCTGATACTAGTACTGTTACAAGAACCCATTATGTGTAGTGATACAACTCTTCTGTATGAAGACAATAAAATTCCCATCGCTTCAACGTCATCAGTTGGTAATTTTAACAATACAATTATCAATTCAAATAATAATATGAAAATTGCTGATGATTGCAAAGAGGAAAATGAAGCAATGGATAATGTTGATTGTGTTATCGCTGAAAGCTGTCAAAAATCTGATGAGTCAACATTAGATGTGAACAATTTGAGTCCTAAACTCAAGCCATCAAAGAGAAAATCTGTCTCTGTCAACTTAGAAATTAAAAAACAAAAGTTGACACATACAGATGATGGAGAAGAGAAATTAAAAGTTCTTATTTTAAACGCAAACGCCATTGAGAAACTATCCCCTGTCATTATACAACCACAAAATTATAAATTAGTGAAACGAAAAATTCGTGATAGAAGTATATTGAAAAACAAAGTTAACAAACCGGTGTGTGATACATATAATGGGAAAAGTGTTAGTGGCAATTTAAAACAGAAACCAATAGATTGTTATTTCACTAGTGCTTACCATCTGAGGTCTTATACAAAGAGATTAAAGGATAATGTTGACTCAGTTAATCACGCACTAGCCGTGGTGCGTGATGAGGCTCAGGAAATGAGCAAAAATTTGGGTATAGCCGACCTCAAATGCGATATAGGCCGGTCGTTAAAGGAGCGCAGAAATGACAAATTACCGAAACTATTGCCGTATTCAGCACCAGCTAAACGAGATATCAAGGCGTCCATGTCGGAGGCGCCCAGCGGTTTGTCGCCGAGAAATAGACAGGATAGCCTAGGCTCCCATTCGCCGAAAAAAAACGCTGACTCCATTCAGCTCGGTTTGGGATTGAAGACGAAGTTAAATAAAGCAGGCGTCAACCGAAACATTCCACATTATAAAATTGTCGCAGCACCTATACCAGGGACGCATTTTGCCGTCGATGCATTCTCATACGGCGATATACCCAACGTGAAACATTACTTCCTGACGCATTTCCATTCCGACCATTATTCGGGTTTAAAGAAGAATTTCAACAAATTGATTTTCTCGGATTTATGCATTTCGCGTTTGGGCGTTAATTTGAAATGCTTCCACGTTATAAACGTCGATGAAACTATAAAAATTGAGGGTGTCGAGGTCACAGCCGTTGACGCTAATCACTGCCCTGGCGCTTTGATGTTGGTATTCACTTTGCCCAATGGTAAGACGCTTCTGCATACTGGAGATTTTAGGGCGTGTCCTCCTATGGAGTCATATCCTGTTTTTTGGAACAAAGATATACACACAATATATCTTGATACGACCTATTGCAATCCTCGCTATGACTTTCCAACGCAAGATCAAAGCTTGGAGATGGCTCTGTACATTTTGAGGCAGAAGAAAATTACTCTAGAAAAGGCCGGGAAGCAGTTTTCATCTGTACTCATAGTGTGTGGAACTTACACCATTGGTAAGGAGAAGTTCTTCCTTGGTCTGGCTCGTCGCGTGGGATGTTCAGTGTGGGCGTGTCCGGAGAAGGACCGCGTGCTGCAGGCGGTGGAGGGTCGCAGCTTCAATCACTCGCAACCAGCCAGCTGTCAGCTGCATGTTGTGCCCATGAGGGATCTGGTGCATGAGAAATTACAGACATATCTGGAAAGTCTGAAAGGATCGTTCAGCGAAGTTGTTGCTTTCAAACCCAGTGGCTGGGAGAATGGTAGAAATTCATCAGTGCAAAAGGACTCTGTTACAATACATGGTATACCATACAGTGAACATTCTAGTTTTTCAGAAATGATTAGATTTGTCAAATTCCTAAAGCCGAAACAAGTTGTGCCCATAGTTGATATTTCCGGTGGGATTAAAACTGTACAGAAGTTTTTTCCTTGTCCTTTGGTTAATAGAGATGATCTGCAGTGTCAAAGTCGGGTCACAGACTACTTCACACACGGCTGA

Protein sequence:

>DPOGS206937-PA
MNDDINDYLPSLLNPSGIKRNSALSQTQVSLSLRTGKIKLNQPLGEVPQLEETATTEPAASTMEGVSSITCNNNGQLGITPIAELNNDTPVSLNFNNIDIEVPKTNTDDNENDNNQNSLILVLLQEPIMCSDTTLLYEDNKIPIASTSSVGNFNNTIINSNNNMKIADDCKEENEAMDNVDCVIAESCQKSDESTLDVNNLSPKLKPSKRKSVSVNLEIKKQKLTHTDDGEEKLKVLILNANAIEKLSPVIIQPQNYKLVKRKIRDRSILKNKVNKPVCDTYNGKSVSGNLKQKPIDCYFTSAYHLRSYTKRLKDNVDSVNHALAVVRDEAQEMSKNLGIADLKCDIGRSLKERRNDKLPKLLPYSAPAKRDIKASMSEAPSGLSPRNRQDSLGSHSPKKNADSIQLGLGLKTKLNKAGVNRNIPHYKIVAAPIPGTHFAVDAFSYGDIPNVKHYFLTHFHSDHYSGLKKNFNKLIFSDLCISRLGVNLKCFHVINVDETIKIEGVEVTAVDANHCPGALMLVFTLPNGKTLLHTGDFRACPPMESYPVFWNKDIHTIYLDTTYCNPRYDFPTQDQSLEMALYILRQKKITLEKAGKQFSSVLIVCGTYTIGKEKFFLGLARRVGCSVWACPEKDRVLQAVEGRSFNHSQPASCQLHVVPMRDLVHEKLQTYLESLKGSFSEVVAFKPSGWENGRNSSVQKDSVTIHGIPYSEHSSFSEMIRFVKFLKPKQVVPIVDISGGIKTVQKFFPCPLVNRDDLQCQSRVTDYFTHG-