Monarch geneset OGS2.0

DPOGS200761
TranscriptDPOGS200761-TA969 bp
ProteinDPOGS200761-PA322 aa
Genomic positionDPSCF300556 + 28436-33587
RNAseq coverage9x (Rank: top 85%)
Annotation
HeliconiusHMEL0055431e-10768.42% 
BombyxBGIBMGA004227-TA9e-13769.72% 
DrosophilaFen1-PA3e-12257.66% 
EBI UniRef50UniRef50_Q7K7A94e-12057.66%Flap endonuclease 1 n=38 Tax=Eukaryota RepID=FEN1_DROME
NCBI RefSeqXP_001651504.15e-13062.89%flap endonuclease-1 [Aedes aegypti]
NCBI nr blastpgi|3202029354e-13369.44%flap endonuclease-1 [Bombyx mori]
NCBI nr blastxgi|3202029359e-14169.29%flap endonuclease-1 [Bombyx mori]
Group
Gene OntologyGO:00062811.1e-158DNA repair
GO:00045181.1e-158nuclease activity
GO:00036775.4e-40DNA binding
GO:00038245.4e-40catalytic activity
KEGG pathwayaag:AaeL_AAEL0058701e-129 
 K04799 (FEN1, RAD2)maps-> Base excision repair
    DNA replication
    Non-homologous end-joining
InterPro domain[1-319] IPR0060841.1e-158DNA repair protein (XPGC)/yeast Rad
[1-107] IPR0060854.9e-50XPG N-terminal
[162-301] IPR0200455.4e-405'-3' exonuclease, C-terminal subdomain
[164-197] IPR0089181.6e-12Helix-hairpin-helix motif, class 2
Orthology groupMCL12447 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200761-TA
ATGGGTATTTTAGGATTATCAAAGTTGATTGCAGATATTGCTCCAATGGCTGTAAAAGAAACAGAGATAAAAAATTATTTCGGTAGGAAAGTTGCCATCGACGCATCTATGAGCTTGTATCAATTCTTAATTGCTGTAAGAAGTCAAGGCGCTCAGCTGACGTCCGTTGATGGTGAAACAACATCACACCTAATGGGTACATTCTACAGAACGATTCGTCTCATAGAAGATGGTATCAAGCCTGTGTACGTTTTTGATGGTAAACCGCCTGATATGAAGTCACATCAATTGAACAAGAGGGCCGAGAGACGAGAGGAAGCTGAGAAAGAACTCCAAAAGGCTACCGAGGCTGGTGACACGGCATCTATAGACAAGTTCAACCGTCGGTTGGTGAAGGTGACTCAGCAACACGGCGCCGAAGCTCGGCAGTTGTTGAAGCTTATGGGCATACCCGTGGTGGAGGCTCCGTGTGAAGCTGAGGCACAATGCGCTGAATTATTCATTGACCTCTGCATTCTGTTGGGTTGTGATTACTGCGGATCCATCAAAGGGATCGGACCGAAACGGGCCATCGAACTCATCAAGCAACACCGCAGTATAGAACAGGTCCTTCACAATATCGACACAAAGAAGTACAGTCCGCCGGAGAATTGGGAATATGAAAACGCTCGGAGACTGTTCCAGCAACCAGAAGTTACCGAGGCGAAGGATGTCGAGTTAAAATGGTCGGATCCTGACGAGGAAGGTCTGGTGAAGTTCCTCTGTGGAGACAAACAGTTCAACGAGGAGCGCGTCAGGAACGGGGCCAAGAAACTCATGAAGGCGCGCACCGGAACCACGCAGGGCAGGCTGGATGGATTCTTCAAGGTGTTGTCAACAACACCAAACCCAAAAAGGAAAGCGGAGGAAGATAAAAAGAGTGCCAACAAGAAAGTTAAAACAGCTGGAAGGGGGCGGAAACCGAAATAA

Protein sequence:

>DPOGS200761-PA
MGILGLSKLIADIAPMAVKETEIKNYFGRKVAIDASMSLYQFLIAVRSQGAQLTSVDGETTSHLMGTFYRTIRLIEDGIKPVYVFDGKPPDMKSHQLNKRAERREEAEKELQKATEAGDTASIDKFNRRLVKVTQQHGAEARQLLKLMGIPVVEAPCEAEAQCAELFIDLCILLGCDYCGSIKGIGPKRAIELIKQHRSIEQVLHNIDTKKYSPPENWEYENARRLFQQPEVTEAKDVELKWSDPDEEGLVKFLCGDKQFNEERVRNGAKKLMKARTGTTQGRLDGFFKVLSTTPNPKRKAEEDKKSANKKVKTAGRGRKPK-