Monarch geneset OGS2.0

DPOGS214287
TranscriptDPOGS214287-TA3546 bp
ProteinDPOGS214287-PA1181 aa
Genomic positionDPSCF300014 + 2133891-2141642
RNAseq coverage1557x (Rank: top 8%)
Annotation
HeliconiusHMEL0114190.082.12% 
BombyxBGIBMGA005999-TA0.082.81% 
DrosophilaCG6621-PA4e-10347.22% 
EBI UniRef50UniRef50_D6WJY60.058.20%Putative uncharacterized protein n=2 Tax=Neoptera RepID=D6WJY6_TRICA
NCBI RefSeqXP_974328.10.058.20%PREDICTED: similar to CG6621 CG6621-PA [Tribolium castaneum]
NCBI nr blastpgi|910823170.058.20%PREDICTED: similar to CG6621 CG6621-PA [Tribolium castaneum]
NCBI nr blastxgi|910823170.047.86%PREDICTED: similar to CG6621 CG6621-PA [Tribolium castaneum]
Group
Gene OntologyGO:00054882.9e-28binding
GO:00055159e-05protein binding
KEGG pathway 
InterPro domain[286-424] IPR0119902.9e-28Tetratricopeptide-like helical
Orthology groupMCL17360 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214287-TA
ATGGAACCAACTTTGGATGCGTCTTTGGTGGCGCAGTCCATAAATTATCATGGGCAGCAGCTACAAAAAACGTGGGAGGCCGAACGAGGTGAAGATGATTTATCCAAGATTGGTGTGGGGCCTTTAGACTTTGCTGTGTACCAGTCCAGGCACAAGCATTTGACTTTTCAAGACAGAGGGAAGAGACTCAAATTACATCAGTTCATCGCGAAGGAAGCATCAGCATTATTCGATGCATCTCTACTGGACGAAACTCCGTCATCTTCCAGCGTAAGCGCTGAAGCTCCTACACCAGAAGACAATCTATTTGCACTGATGCCGCCATTTGAAACATTTTTACATGTTGACAAATCAGACAGGCTGAGACATTTTTTTGATAATGTGAAAACAGGGGAGCTGATAATAGGTGCTGTTATCAACAGAACAGCATCAGGGATGATGCTTAAGGTGCTATGTACTGCTGGACCTACTTCTAGATATGTTGCTGACATCAATGTTAAGGCATTCTTACCTGTTGCTAATATCATACCGGCGGTGGACAAGAAAAATGTATCGAGAAACTACCTGATGAATGATACTGTGTGCTGTGAAGTAATTGAAGTTATTCCCGACACTGACAAAATGGTGTGCGGCATGAAGGGTGTTACTCGGAAGCCTGAAGATTCTCCACCGAAGCCTCCGCTAGGCCTCCTCAGCACTGACGACTTTCCTTTGATATACAAGAAAACTATGGAAATGAAGGGAGAGAGTTATGAAGCTATTTTGGAGAAGAGCCCAGGATTCAATAATCCTAACTGCGTCCAATATCTCTCAGAACTTCTTGGCATATCAAATATGCATTGCAGCAATTTTTCAACATTAAGGGGAGGATTTTCAGCTGCAGAATATGCTGATGAACTCCGTCAAGCTCAAGCAAGCAAGTGGGCATTCCGGTCAGTAGCTGAAGGAATTGAACACTTTAAAGCAGGAAGACATTCAGAAGCATTTCAATGTCTCAATAAAGCACTCAGCATTGATCCCCGAAACGTAGAAGGCCTTGTTGCGAGAGGTGCTCTGTATGCTAATAGTGGAACATTTAAGAAAGCCATAGAAGACTTTGAAACTTCTCTAAAACTGAACCCTAACCATGCAAATGCACGAAAATATTTGGGAGAAACGTTAGTCGCTCTCGGACGCAGCTATGAAGATGAGAACAAAATTGCTGAAGCCCAAAAAGCTTACGAGGATTGTTTGGCGATTATACCATTCCATGAGGAGGCTCAGAATTCACTAGACTTTCTGAAGAGTAAGACGTCTACCACCAAGCCATTAATAGAGCCGGCCGAGTTACTTCTACCTGGATTAACAGGAGCTAAATCGTTTGAGATGAAAGAAACACTGAAGCAATTGTTGAATCTAACAGAGAAGAAGGAAAAGAAGAAGAAGAAGAAGCGTGGGAAAGGCAAAAAGAAGCGCTCCAGTAGTTCCTCGTCGTCTTCGAGTGACTCGTCGAGCTCGAGCTCCTCGTCCGAGTCATCGTCCTCGTCAACAGAATCTAGTGGTTCAGAGGGTCCAAATAGGAAGAAGAAACGTCGCTCGCAATCAAACAACAAGCGACAGAGGTCGCTGTCGCCTCTGAGCAAGCGTATGGCTATGCTGGGAGACGCTGAGTCGGCGTCACGTACACACAACTCGCAGTTCAACCACCCGTATGGTTATCAGCCGCCGCCGCCCGCAGAAGAACCCGCCGCGCCCGGCAGGTCTCAGGCCGATATTGATTATGAATTGAAGGTACGCAAGTTCCTGGACATGACGAAAGAAGATTCTGATTATGAAGAAAAAGTTCGAAACTTCTTGGAAGAGACGGCGCAATACAAACGAAATCGAAAAATGCAAGAACTCGGTCAGCAGACACAACCGGGCGCTGAACATGATAAGAAGAAGAAGAGAAAGAAGGATAAGAAAAAGAAGAAGGAATCAAAACGCAAACGCAAAGAACAAGAGAGAGAAGAGAAAAGAAAAAATAAGATCGCTCGTATGTCAAACAGTTCCGATTATAATCTACGTGATATAGAAAATATTGGTGATAAAAAACTGAGAGATGCTATAAGAAAAGAATTGAAAGGAAAATCAAAGAGAGATCACAGTTCAGATGGTGAATATGAAAAAAAACACAATGAAAAGAGTCGCATACTTGATGAAATGCACGGACTGGAGGAGCTTGAATCCAAGCTGAGTGCGTACCACGTGATGGTGGAAAAGGAAATCGGTAAACGAGACAGATCTCTCAGTCCGCTGGACCAGGTGCCGCCGCCGCCGCTTGACAAGCCCAAGTGGAAAATGTCAATGAACGCTGTCAAAGAAACGGTCAAGAAGAAGGATACTCCAGTACAAAAAGGATACAAGGAGCGTTACGCATTTGAAGATAGCTCTGACGACTCTCAAGATCCTCGAAAGCCGTCACCATCGAGCGGCGACAAGAATGTGTCTGTTCGACGCGCAATGGCCATGTCTATGAAGGAGCCGCCGCCGCTACCGTCAGCGCCGCCACCCAAGAGCAGCCGCGAGCCCGACCCGCCTGGCACGGACCCGCCGCACACACATCAACACCCACACACGCACCCGCACCCACATCTGCACCCTCCTCACGCACCCCCGGTGCGTAAAGGTAATATAGTGCTGGACAAGTTTGGATCATTCCGATTGGCTCAAGAAGGTGAGACGCCGGTGTCTGTAGGAGACGGACGACCAGAACAGTTCGTGACCCGCATCAAGCCGCCGACGCCCTCACAACGAAGACCTCGCTCACCACCATCACCCAGGAGAAGGTCATCCAACTCCTCTGACGATAGACGCTCGGCTAAACGATCTAGAAGCCGTTCCATGCCACGGAAGTATCGTTCCCGCTCCGGATCCCGTTCCCGTTCCGGTTCCAGCGCGAGTGGCTCGGTGGCGTCTCGCCGCAGTCGCACCGTGTCGCCGAGATATCGCTCCAGATCTGATTCCTACTCACGAAGCAGATCACGCTCTCGATCGGGATCGCGCGACAGAAATCGTCGCATGAATCGTCGCGGCAATTGGCGCGGACGCGGCGGTTTCGAGCGTGGCACCTACTACCGTCCCCGTTTCCACACTTACAATGGTGGCGGGAACCGTGGTAGGGGGCGCGGTGACTTCCGCAGGGACGACGGACGTCGCTTCCAACACGAGTGGAGGGATAATCGATCTCGTGGAGGACGGCCTTTCAGACCCAGGAGAGGAGGCGGCGGACGCGGCAGACCTTTCAGGGGTGGCTTCCGTGACTTCCGCGACAGACGCGGCGGTAGATATTCCCGGTCCCGCAGCCCCGACAGAACACGCAGGTCCAGGTCATACAGCCCGGAGAGAAGAGACAAGGACAGAGACAGCTTCTCTCGTTATTCTGAACGCGACAGCCACCGTAGTGAAGGAGAGTACGAGGAGGAGCGTTACGTGGACAGGAAGGAGTATGACGGGAAGTGGGCGGACGGGAACGAGCCAGAACGAGCACACGCCGAGGAAAAGACGGAGGAACCGCCTAAAGAATAG

Protein sequence:

>DPOGS214287-PA
MEPTLDASLVAQSINYHGQQLQKTWEAERGEDDLSKIGVGPLDFAVYQSRHKHLTFQDRGKRLKLHQFIAKEASALFDASLLDETPSSSSVSAEAPTPEDNLFALMPPFETFLHVDKSDRLRHFFDNVKTGELIIGAVINRTASGMMLKVLCTAGPTSRYVADINVKAFLPVANIIPAVDKKNVSRNYLMNDTVCCEVIEVIPDTDKMVCGMKGVTRKPEDSPPKPPLGLLSTDDFPLIYKKTMEMKGESYEAILEKSPGFNNPNCVQYLSELLGISNMHCSNFSTLRGGFSAAEYADELRQAQASKWAFRSVAEGIEHFKAGRHSEAFQCLNKALSIDPRNVEGLVARGALYANSGTFKKAIEDFETSLKLNPNHANARKYLGETLVALGRSYEDENKIAEAQKAYEDCLAIIPFHEEAQNSLDFLKSKTSTTKPLIEPAELLLPGLTGAKSFEMKETLKQLLNLTEKKEKKKKKKRGKGKKKRSSSSSSSSSDSSSSSSSSESSSSSTESSGSEGPNRKKKRRSQSNNKRQRSLSPLSKRMAMLGDAESASRTHNSQFNHPYGYQPPPPAEEPAAPGRSQADIDYELKVRKFLDMTKEDSDYEEKVRNFLEETAQYKRNRKMQELGQQTQPGAEHDKKKKRKKDKKKKKESKRKRKEQEREEKRKNKIARMSNSSDYNLRDIENIGDKKLRDAIRKELKGKSKRDHSSDGEYEKKHNEKSRILDEMHGLEELESKLSAYHVMVEKEIGKRDRSLSPLDQVPPPPLDKPKWKMSMNAVKETVKKKDTPVQKGYKERYAFEDSSDDSQDPRKPSPSSGDKNVSVRRAMAMSMKEPPPLPSAPPPKSSREPDPPGTDPPHTHQHPHTHPHPHLHPPHAPPVRKGNIVLDKFGSFRLAQEGETPVSVGDGRPEQFVTRIKPPTPSQRRPRSPPSPRRRSSNSSDDRRSAKRSRSRSMPRKYRSRSGSRSRSGSSASGSVASRRSRTVSPRYRSRSDSYSRSRSRSRSGSRDRNRRMNRRGNWRGRGGFERGTYYRPRFHTYNGGGNRGRGRGDFRRDDGRRFQHEWRDNRSRGGRPFRPRRGGGGRGRPFRGGFRDFRDRRGGRYSRSRSPDRTRRSRSYSPERRDKDRDSFSRYSERDSHRSEGEYEEERYVDRKEYDGKWADGNEPERAHAEEKTEEPPKE-