Monarch geneset OGS2.0

DPOGS200457
TranscriptDPOGS200457-TA5463 bp
ProteinDPOGS200457-PA1820 aa
Genomic positionDPSCF300260 - 95268-114410
RNAseq coverage115x (Rank: top 58%)
Annotation
HeliconiusHMEL0107879e-17534.91% 
BombyxBGIBMGA005445-TA0.068.62% 
Drosophilaegg-PA3e-4627.88% 
EBI UniRef50UniRef50_B0WQS55e-18037.67%Tetratricopeptide repeat protein, tpr n=10 Tax=cellular organisms RepID=B0WQS5_CULQU
NCBI RefSeqXP_001950304.10.050.47%PREDICTED: similar to conserved hypothetical protein [Acyrthosiphon pisum]
NCBI nr blastpgi|1700565780.039.89%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastxgi|1700565780.037.02%conserved hypothetical protein [Culex quinquefasciatus]
Group
KEGG pathwaynvi:1001200142e-69 
 K11421 (SETDB)maps-> Lysine degradation
Orthology groupMCL16717 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200457-TA
ATGGACAGCAATATAAATAGCGGGTTAAATAAGCCGGTAAGTGAGGAAAAAACTAAGGATAGTGAGAAAATAGCTGATTTTGAATTAATTTTACTACGTAGAAGACAGAATGCCGCGAGAGTTCGTGCCTGTAGGGAAAGGAAAAAAGCTCTTGGCCTCCAGTCATCTTTACCAGTTGATATTAAAACTGAGCCGCCAGATGACATCAGTGGGCTAACAGATTCAATGCCAAGTACAAGCTGGAGTCCAGCGTCAAATCTTGGATTTAATGCAATTTCAAGTGAGAAGCCCGCAGTTGATGATATTGATGCTGAAAAGGATCTGTATCAAAAGCAGCTAAATGCAGAAAGATGTCGTAGGTATCGTCAAAAATGTAAACTGAAATCTGGCGTCCGTACTAAGCGAACAACAGGGGAACTGGATACTAGTTCCAGTCGAGACTTAGTTGCTGGGGGAGATGGATTTGAAAGCTCTACCAGTCAGACACAAGGTTCTTCCAACGACGAATCTACATCATCAGGTGCTAAAAGTACAACCGCCGCTACAAACAACGCTAATTCCAGCTCTGCTGCGGGGTTGTACTGTCGTCGGTACAGGGAGAAACTGAATGCACGAAGAAAACGAGTAAAAGAAGATCCCTCATCATTCTATACATTGTATGTCAAACATAACGGAGCTCATAACTTATTCGAAAACTTATTCGATAATAATCCATTTGGTTTTTCGTGTACTGTTTGTGATAGACTATGGTTTGAAAATGAATTGAGAAGTCCTCCTTCGTCTTGCGGTGAAATTTTAAGACAGATATGCCCAAATGTCCCCTGTCAAGATATTGTAGTTTGTGCCGCCTGTAAGGTCTCATTAGTTGCTGGAAAAATTCCGAATTTGGCGGTGTACAATGGTTTTAAGTATCCGCCTAAGCCGAATCTCCCTCAAATGGATATGGTTTCAGAGCGATTAATATCACCAAGATTACCCTTTATGCAAATTAGACGTTTGCGATACGTTGAGGGACAACATAGTGTCACTGGTCAAGTTATCAATGTGCCCATTCATGTTGACAACTTGGTCCAAACTCTGCCTAGAAATATGGTCGACGATTTCTGTATAAATGTTCACGTAAAAAACAAATCACTGCACAAATCTAGTTATTTGCAAGGATTATTAAAAAAGCGGGTTATTAGAGACTGGCTTGATTATCTTATAGACACCCCGCTTTATAAGCATTATAATATAAAAATCAATCCGTATTTCTTAGAGGATTTGAACAACGAGAGTGAAATGCCAGACATTGATTTAAAGGATATCGCTGAACCAATTGTAATTGGTGATAGCCTAGTTGCTGAACAGCATACACTTTTATGGTCCACGGAAAGAGATCTACAAATAGCACCTGGAGAGAATAAAAGGTCTTTGAGTTTACTGTTTGATGCTTATGCGGAAGAATTATCTTTCCCAACTATTTATTACGGTCAATTTCGGAAATTCAAAGACGGTGTTAATTGTAAAGCACACTCAATTGCAACAAGTGAAATTCGGCGGACCGATCGACGAGGGGCCATTCCTCGCCACTTGCTCTTCTTAGTCATGAAGGTTATTTTATTTAGACTGAGTGAAAACATCGGTATTGCTTCTAAATATATCGTGGAAGACACAAAAGTAACGAAAGAGCAAATATTATCTTCCGACTATCTTAACGACTGTCGGGAACCAAATTTGTCTTTTCTCAAATTCATACCAAATTCAGTACAATATTGGCAAAATCGAAAAAAAGATCTCTTTGCAATGATAAGACAACTTGGGACCCCAACAGTTTTTTTGTCCCTGAGCGCTAATGAAATATCATGGAAATGGCTTTTAAAAACTTTGCATAAACTGAAACACGGCACGGAGATCTCTGATTTAGAAATTGATCAGATGCATTATAAAGTCAAGGCAGAATTAATAAATGAAGATGCTGTGACTTGCGCCATTTATTTTAATAAACTCGTCAATGTTATAATGACAATTCTTCAAAATAAAACCGTTAGCCCATTCGGTAAACATTATGTACGACATTATTTTAGGAGGATTGAATTTCAACACAGAGGAAATACTCATGCACATATCTTGCTGTGGTTAAATCAAGCGCCTAACGATGCTTTTGGAGGTGACATGACTTCTGCTATTAAACTTATCGATAACTTAATTTCAGTATCAAAGACAGAATGTTCAGGGCACATTGAATTGGTCACACACCATCACACATATTCGTGTTATAAAAATAATCAAAACCAACTAAAGTGCAGATTCAATGCTCCATATATGCCTAGTAGAACTACCGTTTTGCTTGAACCTATGGCTAAATCGTCTGATGAGGAAAAACGAATTTATAACGAGTATAAAAAGAGATATCACATCATACACCAAAAACTGGAATGTCATGACTATTATAATATTGATGATTTTTATCGTAAAAATGGCATAAAGTCAGATGTAGAGTATTATAAAATTCTGTCCGCTGGAATTCTACGACCGATGGTTTTCGTTAAACGTCACCCTAATGAAAAATGGCACAACTTTTTCAATCCCTTCATATTTCACCATTTACAATCGAGCATGGATATCCAATACATCACAGACGAGTATTCTTGTGCTGCTTATATTGCGGAGTGTGTAAACAAATCCGATCGTAGCGTCAGTAATCTTCAACGGGAGCTGTTGGATCTTTTGGAGAAAAATCCAAATCTAGATTTGGTTGACATGACCAAACATATGAGTGTTAATATTTTGAATGCAATGGAAATGTCCAGCCAAGAAGCAGCATGGTTCCTTCTTAGGGAACCATTGTGTAAGTCGACTCTTAAAGTAGAATTCATACCCACAATGTGGCCTCAGGAACGTCATCGTTTTAGAAAAACTGAAAAAGAATTAGACCGTCGTCCCGATGAAGACACAAGTGTTTGGAAAGAAAACTATTTTGAGAATTACGAAAATCGACCAGCAGAGTTGGAAGATGTCTCACTGATTCAGTTTGTTGCTTGGTATAAGACCAGAACTAGAAAAAAAATGTCAGGACCTCAAATTGCGAACCAAAACTGGGATTCGGAAGACGAAGAAGACGTTGAAGAAGATATAACACCAGAAGAAAATATGAATCAGCAAAGTGAAAAAGTATTTTACCGTCGTAAAACTCCAAGAGTCGTAAGCTATTTGCGTTACGATATGACTGATCATGAGCTAGACTACAAACGAGAAATGGTTACCTTGTTTATTCCATTTCGTAACGAAGAAAGGGACATTTTAGCTGATATGAGTTTCAATCAAATCTATGAAGAAAATGAAGAACTCATTTTAGTTCGCCGAGAAGACTTCGAAGGAAATTTGGACATTGATAAAATTTTTGCGGCATACAGAATATTATGTCACCCTGATCAGAATGGCAGAGACCTAGAAATTTTTCCGCTTATGCATCACGATGCGGATCCATTCAGGGAATTTTCCAACAATCCTATTCCAGAGCAGGAAGTTGTTATATTAAATGCTGAAGATGAGTCAGAAGAAGTCATTGATAGTAGAAAATTAGATGTAGCTGAGAGCTATGTTGATTTACGAGACAGTTCCGAGGAAGACACTCAAGACACGAAGAAAACTGATCATGAAGTAGACAATCTTAATACAACGGAAGATATTGATGAAGATGAGCCAGCGAAGAAAGAAGACACGGTATTGCCCATGGTGAGGTGCGTCAACAAGATGTGCGCTCGCACTTCCTTCGACTTCTACACAGCTGAGCGCAGCACAGTGGACTTCTATGATCCAGAGAGAAAGAAAAGAGGTTATGTTTGCAGAACCTGTCTCAGTCTGGTCGAAGAAAGGAATCAGCTGTTGATCAGCGCCTTTAAATCCCAGACGCCCCTCCTACAATTGGAGACTCGTCAGCAGGAAGAGGATCTGGTAGAGATATCAGAATCGCAGTCCGAAGATAATCTAATACCGGAAGTTGATGATGACGTCATAGGCGAGGAGGGGGCTAGGTTTATAGAGGAGAAGTTGACTGATGTCCTGAACGAGACCTGGGTCAAGTACAACATGGATGACCGGCTCCAGGAGGCACAGGATCAGCTAAAACAACAGCTGGAACAACTGCAAAAGCACAGTTTGGAGATCGACCAGTTACTGGACGAGTGCCAGATGTCCACAGATAAGCTCCGCACAGAGATTTACTCTACGTTCAAGCCAGACATCAGGAAACTTCCGTCGCTACTTATATACGACGTGCCAGATTGCTCTTACACCTTCGTCGACCTCGCTGAACAGGGAAGTAGACTCTTAAATCTTCGGAAATCATCTTTATCTGAGTCTCCAACAAAAAAATCCACAACAGATCAGGATTCTGATGAATCAGTGGTACATATATCTGTGGAATCCGCTCCCTCCCACCTGCCTCCCGTGGGGGAGTTGTCTTATCCTCAGTTGGAGGTGGGAATGATAGTCTACGCATCTAAGAATGCTTTGGGGACCTGGATGAAGGGCAAGATATTGGAGATAACACCAAAGTCAGAAGATATTGAACGGATCTACTTGTGGGATCTTCAGACCAGAGTTGCTATTCTAATTTACTCTATCAACATTGGTCATCCAGCCAATCAAGCTACTCTCATTGATCATTCAGTCAATCATCCAATCCATTTAAGCAGAGAACTGCCACACTGTACACGTGTGATAGCCTTGTTCAAGGACATCATGAGGCGCGAGTTTTTTTACCCGGGTATCGTCGCAGAAATGCCCAACCCAAGGAATAGTTACCGCTACCTGATATTCTTCGACGATGGCTACTCTCAATACGCGCCGCACTCTAAGGTCCGTCTGGTGTGCGAGTGCGCGTCTCACGTGTGGGAGGAAGTACAGCCCAAGTCGCGAGAATTCGTCCGAAAATATCTCCTGGCTTACCCTGAGAGACCCATGGTGAGGTTGCACCCTGGACAGAGCTTGAAGACGGAATGGAAGGACAACTGGTGGTCATCCGTGGTGGTGTCGGTGGACGCGTCGCTGGTGGAAATCCAGTTCCTCCAGCTGGAGAGACGAGAGTGGATCTATCGAGGATCCACGAGACTCGCCCCCCTGTACCTGGAACTGCAGGCCGCGGAGAGACACAGGCCCAGAGCCCTGCCACGGACACAGACCACGAGGACGAACATGCCCTACGTGGAGTACACCAGATCTGAAGAACAGACGAGCAAACAAGCCAAGACTTCGCCACAGCAACAACAGAGCGAGGGATTTCCTCGTCAGCGAGCCGTTGCCAAGAAGACTACCACGAAGACTCGCCAACCACCCCGTACAGCCGTACAGAGCCTCGACCACTTTACTAGTAAACTAGTGAAAATTATTTTGGATTCTCATTCTCATTACTTGCTTGTAATTTTGCTGGTCTTGTTCCTGATGAGTGTCTGCCTTGCTTTATCGTCGAAGTTTTGTTGA

Protein sequence:

>DPOGS200457-PA
MDSNINSGLNKPVSEEKTKDSEKIADFELILLRRRQNAARVRACRERKKALGLQSSLPVDIKTEPPDDISGLTDSMPSTSWSPASNLGFNAISSEKPAVDDIDAEKDLYQKQLNAERCRRYRQKCKLKSGVRTKRTTGELDTSSSRDLVAGGDGFESSTSQTQGSSNDESTSSGAKSTTAATNNANSSSAAGLYCRRYREKLNARRKRVKEDPSSFYTLYVKHNGAHNLFENLFDNNPFGFSCTVCDRLWFENELRSPPSSCGEILRQICPNVPCQDIVVCAACKVSLVAGKIPNLAVYNGFKYPPKPNLPQMDMVSERLISPRLPFMQIRRLRYVEGQHSVTGQVINVPIHVDNLVQTLPRNMVDDFCINVHVKNKSLHKSSYLQGLLKKRVIRDWLDYLIDTPLYKHYNIKINPYFLEDLNNESEMPDIDLKDIAEPIVIGDSLVAEQHTLLWSTERDLQIAPGENKRSLSLLFDAYAEELSFPTIYYGQFRKFKDGVNCKAHSIATSEIRRTDRRGAIPRHLLFLVMKVILFRLSENIGIASKYIVEDTKVTKEQILSSDYLNDCREPNLSFLKFIPNSVQYWQNRKKDLFAMIRQLGTPTVFLSLSANEISWKWLLKTLHKLKHGTEISDLEIDQMHYKVKAELINEDAVTCAIYFNKLVNVIMTILQNKTVSPFGKHYVRHYFRRIEFQHRGNTHAHILLWLNQAPNDAFGGDMTSAIKLIDNLISVSKTECSGHIELVTHHHTYSCYKNNQNQLKCRFNAPYMPSRTTVLLEPMAKSSDEEKRIYNEYKKRYHIIHQKLECHDYYNIDDFYRKNGIKSDVEYYKILSAGILRPMVFVKRHPNEKWHNFFNPFIFHHLQSSMDIQYITDEYSCAAYIAECVNKSDRSVSNLQRELLDLLEKNPNLDLVDMTKHMSVNILNAMEMSSQEAAWFLLREPLCKSTLKVEFIPTMWPQERHRFRKTEKELDRRPDEDTSVWKENYFENYENRPAELEDVSLIQFVAWYKTRTRKKMSGPQIANQNWDSEDEEDVEEDITPEENMNQQSEKVFYRRKTPRVVSYLRYDMTDHELDYKREMVTLFIPFRNEERDILADMSFNQIYEENEELILVRREDFEGNLDIDKIFAAYRILCHPDQNGRDLEIFPLMHHDADPFREFSNNPIPEQEVVILNAEDESEEVIDSRKLDVAESYVDLRDSSEEDTQDTKKTDHEVDNLNTTEDIDEDEPAKKEDTVLPMVRCVNKMCARTSFDFYTAERSTVDFYDPERKKRGYVCRTCLSLVEERNQLLISAFKSQTPLLQLETRQQEEDLVEISESQSEDNLIPEVDDDVIGEEGARFIEEKLTDVLNETWVKYNMDDRLQEAQDQLKQQLEQLQKHSLEIDQLLDECQMSTDKLRTEIYSTFKPDIRKLPSLLIYDVPDCSYTFVDLAEQGSRLLNLRKSSLSESPTKKSTTDQDSDESVVHISVESAPSHLPPVGELSYPQLEVGMIVYASKNALGTWMKGKILEITPKSEDIERIYLWDLQTRVAILIYSINIGHPANQATLIDHSVNHPIHLSRELPHCTRVIALFKDIMRREFFYPGIVAEMPNPRNSYRYLIFFDDGYSQYAPHSKVRLVCECASHVWEEVQPKSREFVRKYLLAYPERPMVRLHPGQSLKTEWKDNWWSSVVVSVDASLVEIQFLQLERREWIYRGSTRLAPLYLELQAAERHRPRALPRTQTTRTNMPYVEYTRSEEQTSKQAKTSPQQQQSEGFPRQRAVAKKTTTKTRQPPRTAVQSLDHFTSKLVKIILDSHSHYLLVILLVLFLMSVCLALSSKFC-