Monarch geneset OGS2.0

DPOGS208836
TranscriptDPOGS208836-TA3435 bp
ProteinDPOGS208836-PA1144 aa
Genomic positionDPSCF300036 + 776626-784405
RNAseq coverage921x (Rank: top 14%)
Annotation
HeliconiusHMEL0154370.061.42% 
BombyxBGIBMGA007943-TA0.054.05% 
Drosophila% 
EBI UniRef50UniRef50_F4WTP36e-9735.68%Protein SMG7 n=5 Tax=Coelomata RepID=F4WTP3_ACREC
NCBI RefSeqXP_001605792.12e-10636.51%PREDICTED: hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|1565431504e-10536.51%PREDICTED: hypothetical protein LOC100122190 [Nasonia vitripennis]
NCBI nr blastxgi|3287930754e-10729.69%PREDICTED: hypothetical protein LOC409556 [Apis mellifera]
Group
KEGG pathway 
InterPro domain[59-176] IPR0194581.4e-13Telomerase activating protein Est1
Orthology groupMCL17822 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208836-TA
ATGGTTTTAAATGCCGCTGTGCAGTTATTAAGAGAGGCGGAGGAATTGAAACAGAAGATTTTAAAATTCAATAGTTGCATTTCTATGCTTCAGGATAGAAGTTTATGGGTAACACAGCAACAGTTACAGAAGGTGTACCAGAAAGTTTTGGTACTGGATCTTGATTATGCTCTAGAGAAGAAAGTGGAACAAGATCTATGGAATGTTGGGTTCAAGCAGCAGATTGAGGCTTTGCAGGCCATTTCCAAAGATAGAAAGAGTGTCCTCAGAAGTGAAGCTCAAGGTATGCTGTCATGGGTGCTGCAGGCTGCCGCTGGGTTCTACCTATGCCTCTTGCATCAAATCTGCACAACATTTAAACTAGATCTACCATTTAGACGTAGGGCGTCCCTTCTTGGCTCGGTTGAAGGGTGGGAGGCCGGTGGGTGTCCGGAACCCGTTCGAGCCGGTGCTGGAGCTGCCCGGTATGCGTGCCAGCACTGTCTCGTACACCTAGGGGACCTCGCCCGTTACAGACACCAGCTGAAAGTCGCACACACCTTTTACAGGCATGCCCTAGCGGTGTCTGTGCATTCAGGGCAGCCATACAATCAGTTGGCGCTGGTCGCTTGGCGTCGTGGCCGCCGTCTGGCCGCCCTCTACTGGCACGTCCGGTCGCTGCTGGTCCGAGCGCCCTTCCCTCCCGCCCCCGCGAACCTCACCCGGACCCTGGCGGCCGCGGGAGACACTGTGCAAAAATGTTTCAGTCGTGACGTCAAGGAGACGCCGCTGCCCGTGCTGCCGGGGCTGTCCGGGGTGGAGGGGCACGCCACCAGCGCACCTTCAAAGGCAACTGTAACTGAAAAACTCGACTCGCACTCCTATGTAAATGAACTAGTACGAGCACTCCACTACCTGCACAGCTTGGAACATCTCGACACGGCCGAAGAGCTGGTTGGGAAGCTGAACTCGTCCCTGACACACCTCGTGGCCACTGACAGCTTCGATTCCATGACTCTGGTTAAGATGGCGTGCGTCACAATCTGGCTGGTTCACTCCAGTACGGAGGACCTTTCGGTGGAGCCGTCGTCCATGAGCGAGTCGGAAGGTCGGGCCGCAGTGCTGGCGTGCTCGCTGGCCGCTCACAGCGTGCTGGCACTGCTGCTCGCAGCACACACTGGGGACACGCCCAACAAGGGCTTGCCGGCATTGCGTGTGTGGCTTCAGTGGTCGTGGTGTCGGCCCGCGGCGCTCCGGTCGCATGCTTGGGGCTCCAGACCTCACATGTGGGCAGCGCTCGCACACGCACTCAACAACATGGGAGACGCCCTCGAAGACCCCGCCTATGAGACCCTCCCTCTGCCGGAGGATGAAGAGTTACACGGCTTCTTACCGCTGGAGGAGGCTTTGAAGGGACTCAAGTTTCCAAACCACTGCGGCTGGGACTCCAACAAACTGCCTCAAGAGGAACCCGAAGAAGACACGGCGTCCAGTGTATCAGCGTCGTGGGGGTCGTCGTACCTGGCGCTGGTCAGCGACACCGAGCTGCAGGCCCGCGTGAGGACGGCGCGGCTAAGACGACTCGGGGAGAAACTGGCGGAGCAGCACCCGGGACTACTCACCTGCGATACTGACGAAGACGGGGTGATGACATTTTCCACGAGCGAGTCTAGTAAGGAGCAGCTGTCCTTGGTGTTGGCGACCCTGACCCCGCCCTCCGCCCCGCCCACAGAGCCTAAGACCCCACCTCCGGCACCCCCACCACCACCACTCATCATATCGGAGGCTGACTTTCGAGAGAAAGTACGAGAAAAACGCGCTGGCATTCTCAAGCCGCAGGGGTCGCTGGAGCGCGCGAGGGAGGAGAGACGAGCCGCGCCCGCCGCTGAGGATCAGGACGGCGAGGAGTGTGAGGAGGGAAGTAAGAACGAGGACAAGAAGGAGGCTCGCAAACCACGAGTCAACATCGCCATGGCGGCCATCATGAGGAAACAGGAGGAGAGCAACAAACAGGTTAAATTTGTAACTCCACCCCCCACGCCGGAGACCACAGATGAAGCGAGCGAGAGTTCGTCGAAGGATGAGAAAACCAAAGTCATTCAACCGAAGGCCATTAAATCATTAGCAAATTTACCGGTGGGAAGAAAAACGGGGGGAATTCTCTCGTTGAAAGATAAGTCGGCCGGATATCCGCACCTCCAGAATACGGAAACGGAAACGAAGAAACCGGAACAGGAGGAGATGAACGAAGAGAAGTCTGCCCAAAACAGTTCCGTCTCGCAGAGCTACCATCAACGCGATCAAGGTACCAACTGGCCGACGATGCCGGCGCCCTACGGTGACAATAATAAAATGAACTTCCAAAAGAATTACGGAATACAAAACAGCGGCATAAGTTACAACCCCAACTACCAACCTCCCCCCAACACTCAGGGGATACGACTACCTGTTGTCAACCCCAAGGAGATCGACGTCAGGACGGCGGCGCTTCAGAAACAGAACTCTCGCCAGGAAATATTCCAGGAGGCCAACAAATTCAATCACGGATACCAAATATCGGGGGACAAAAAGAATTTCCTCAACGACCTGCCGCCGAGATTCGCGAATCAGTACCGCTACTGGCAGAGTCCGCAGGAAAACCAGTTCAACGACAACAAGTTCAGGGACGACAGCAACAAACTCACCGCGCCCTTCACGGCACAACCTCCGAGACAAAATTGGCCGAACCAGAGTGAAAACTTCCAACAGGGGATTCCCTGGTGGAAACCTGATACCCGCACCAATTTCAATCAGCCTAACTTCTCCACCACACCCATGAATGTACCAAACTTCTATTCTCAAATGTCCGGGAACGTGCCTAATATTTATCCAAATTTACAATACAGCCAGATGCAAGGACAGAATATTGGCCAAAACATGCCAACCGTGGGCCAGAATTTGGCAAATATAGGACGAAATAAACAGGATAGTTTGGCGCCGTCGTTCGGTCAGTCGGCCGTCGGTCAGCCGCAGCTCCAGACCCTGGCGAGCATGGTGTCGTCTCCCGGCTACGGCTCGGCCTTGAACAGCTTCACCCCCTACCCGGCGGCCGTCAGCTACGACTCTTCCTTGTATCCTCAGTTCAACAAACTCGGCTACCAGCCCCTGCAGCTGAACAAACAGAACTTCCAAGGGAAGGAGTCGGAGCCCGGAGTCAGCTTCGGCAGCAACGTGCTGGACGTACAGCATATGAATTATAACGAACCGTTCGTCGCCGACGGAGCCAACGACGCCTCGGAAGACGCGGCGGGCGCTCAGTCGGAGGCCGGCGTCTCCAACACATACTCGCTGTTCCGACAAGACGCGCACGCCTGGCCGCCCTCCACACATCAGTCGCTGTGGTCGGGGCCGGGCGGGTCTCCGCTGGAGCGTCTCCTCGAGCAACAGAAGCAGATGAAGCCGCCGTCGACGCACTGA

Protein sequence:

>DPOGS208836-PA
MVLNAAVQLLREAEELKQKILKFNSCISMLQDRSLWVTQQQLQKVYQKVLVLDLDYALEKKVEQDLWNVGFKQQIEALQAISKDRKSVLRSEAQGMLSWVLQAAAGFYLCLLHQICTTFKLDLPFRRRASLLGSVEGWEAGGCPEPVRAGAGAARYACQHCLVHLGDLARYRHQLKVAHTFYRHALAVSVHSGQPYNQLALVAWRRGRRLAALYWHVRSLLVRAPFPPAPANLTRTLAAAGDTVQKCFSRDVKETPLPVLPGLSGVEGHATSAPSKATVTEKLDSHSYVNELVRALHYLHSLEHLDTAEELVGKLNSSLTHLVATDSFDSMTLVKMACVTIWLVHSSTEDLSVEPSSMSESEGRAAVLACSLAAHSVLALLLAAHTGDTPNKGLPALRVWLQWSWCRPAALRSHAWGSRPHMWAALAHALNNMGDALEDPAYETLPLPEDEELHGFLPLEEALKGLKFPNHCGWDSNKLPQEEPEEDTASSVSASWGSSYLALVSDTELQARVRTARLRRLGEKLAEQHPGLLTCDTDEDGVMTFSTSESSKEQLSLVLATLTPPSAPPTEPKTPPPAPPPPPLIISEADFREKVREKRAGILKPQGSLERAREERRAAPAAEDQDGEECEEGSKNEDKKEARKPRVNIAMAAIMRKQEESNKQVKFVTPPPTPETTDEASESSSKDEKTKVIQPKAIKSLANLPVGRKTGGILSLKDKSAGYPHLQNTETETKKPEQEEMNEEKSAQNSSVSQSYHQRDQGTNWPTMPAPYGDNNKMNFQKNYGIQNSGISYNPNYQPPPNTQGIRLPVVNPKEIDVRTAALQKQNSRQEIFQEANKFNHGYQISGDKKNFLNDLPPRFANQYRYWQSPQENQFNDNKFRDDSNKLTAPFTAQPPRQNWPNQSENFQQGIPWWKPDTRTNFNQPNFSTTPMNVPNFYSQMSGNVPNIYPNLQYSQMQGQNIGQNMPTVGQNLANIGRNKQDSLAPSFGQSAVGQPQLQTLASMVSSPGYGSALNSFTPYPAAVSYDSSLYPQFNKLGYQPLQLNKQNFQGKESEPGVSFGSNVLDVQHMNYNEPFVADGANDASEDAAGAQSEAGVSNTYSLFRQDAHAWPPSTHQSLWSGPGGSPLERLLEQQKQMKPPSTH-