Monarch geneset OGS2.0

DPOGS208637
TranscriptDPOGS208637-TA3648 bp
ProteinDPOGS208637-PA1215 aa
Genomic positionDPSCF300281 - 280047-289534
RNAseq coverage118x (Rank: top 58%)
Annotation
HeliconiusHMEL0117380.057.34% 
BombyxBGIBMGA007771-TA3e-9264.59% 
DrosophilaCG5038-PA8e-12438.10% 
EBI UniRef50UniRef50_D6WKI56e-17445.86%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WKI5_TRICA
NCBI RefSeqXP_973495.24e-17445.82%PREDICTED: similar to transmembrane and tetratricopeptide repeat containing 4 [Tribolium castaneum]
NCBI nr blastpgi|2700075542e-17345.86%hypothetical protein TcasGA2_TC014151 [Tribolium castaneum]
NCBI nr blastxgi|2700075543e-17445.86%hypothetical protein TcasGA2_TC014151 [Tribolium castaneum]
Group
Gene OntologyGO:00054882.3e-55binding
GO:00055154e-09protein binding
KEGG pathway 
InterPro domain[927-1184] IPR0119902.3e-55Tetratricopeptide-like helical
[749-825] IPR0136181.5e-28Domain of unknown function DUF1736
[1044-1077] IPR0014404e-09Tetratricopeptide TPR-1
[1044-1077] IPR0197349.9e-09Tetratricopeptide repeat
Orthology groupMCL14901 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208637-TA
ATGCCCACAGTTATTTTATTGGATGTTTCTCTTTCAATGTCTAGGCCTTTACCTAACAGTGATTCTACTGAGACTCATACTCGATTTACATTAGCGACCGCTGCTATAAACACATTTTTGGACTATTTATGTGTCCATGCTAAACTGGAATATGTAGCGTTAGTAACATTTTCATCATTATGTGAAGTTTCAGTTCCCTTTACTCGAGAATTTGACAATATTCGCGTTAAATTGCCTACGTTAGAAGAAGGTGACAAAACATGCATAGAAACTGCTCTTCATGGAGTTAACCAGCTGGTTTTAAATGAGTGGGGTTATCAGACAGCAGTTCAAATAATACTGATCACTGATGGTAGCTGTGGTGTGGGTTCTATTGGGAGGAATAGGATTATTAAAGCATTGCCACTGCCCCCAACTTACCCTGGCAAAATTCATACGATAAATTTAATTATCTCTCATTCATCTATGCCACTTTATCAGAAAATAGTTGACTTGGCCAGCAATTCCGTGAATAATACAAATGTAACGATATCCAGAGGTTCAATATATTGTCCTGATCAACTTAACATACCTGGAGTAATTGCTGCTATGACTCGTCTTTGTGAACAACACTACCAGGAGTTCTGGTGCACTCTCAAGTGTGGACAGCTGGAGACCAGGGTTCAGCTGTTCCCTGCCCCCCAGCCTGCTTCACAAGACTGTCTAGGCGCTACATACACTTTGTCCAACCAGTTGCATGTCATTGGATTTTTAACACAACAGGATTTAGGTACCCCAATAGCAATAAGCAAGCATCTTGTTATACCACAAGCTCAAGTTGCTAGTAATGCTCCACATCGTGAAAACTATGATCCTAAAACACCCACAAAGGAAAGTAGCAGTTCTGACGGCACTTCTACTGATGATGATATGTCCGACCCGAGCAAAGTTCCTAACTTCTGTGTTCTACTACATGGAGCTCTTAAGGTTGAAGGCATGTCAGCAATAGTTCAGTTGGGTGTTGACTGGTGGGGTACTTTATCGGCTTGGTGTGAGGTGTCCCGTGCTAGAAGGTCCTGTCTGCTGCTGAGTGTGATGCGTCCTAGCGCCTCCGCAGCGCCCTGGCTTGGACCCTTAGATCAATTGGGACCCTCTGAAGATAATAGCACCTCTACGGAAACATTCCCCGTTCGTTCATGGCGATCATACAGCGGCGGTTCCGGTTACGCTTGGGCGCGACCACACACTTTGTTGGCGGACGTTCAGAAAGTATTACGACATGCTAGGAAGTTACCTGATAAGACTCAACATTTGTATAAGGAATTAAACCGTCTTCGTAGAGCAGCTATTTCACTGGGCTTCTCAGAATTGCTCACATGTGTGAGCACTGCCTTAGAGCGTGAATGTACCACGCTTCCCCCCTCAGCCCCACCAGAATGTGCCTTACAGCTGGCCCACGCTGCTGCCGCCCTTAGAGACCCCAGAACCGCTTTAGACGTGAAACATAAAATTCTTTACATCACGTTCGTATCAAGTATACCTTTTATGTTTAGTTTACAAGGTGACTTTGTTTTCGATGACTCCGAAGCTATTGTTAAAAATAAAGATATCAGCTCTGATTCATGGGTACAACCCTTCTTTAATGATTTTTGGGGAACAAATATCAGGAGTAATCTTAGTCATAAATCTTACCGGCCTTTAACTATACTTACTTACAGACTGAATTATTTTTTAAGCAACAAGAATTTAACTGCAACACAATTTAAAATCACCAATCTTTTATGTCATGTTGCTTGCTGTTTATTAGTGTGGAGAACATATAGTTGCATATGGGAAAGGTTTAAAGGAAAATATGTTATGTCAAGTACACTTAATGTGCCTGTAATAGCCACTTTGATGTTCGGAGTGCATCCTATTCATGTGGAAGCAGTCTGTGGAATTGTGGGACGAGCGGACTTGTTGTCTGCATTAACATTCCTTCTATCATTCCTAATCTACGATAAATCTATAAAGACTGACAGTTATATTTATTTATTTTTAAGTTTAATAATAGCAAGTGCTTCTATGTTCTTTAAGGAAAATGGTATAACTGTTCTGGGTGTTTCTTGCATATATGATTTATTGTGCAACATAAATAAGAGAGACAATAAAAAGAAATTAAGTGATTACACATGTCTCAAAAATATACACATTAATATCAAATGTGCTTGTAGAATAATTTGTGTTGTTGCCTCCGCAATCATTTTGCTTTACATGAGATGGATAATAATGGGCAGAAATACGCCAGAATTTAAACCGACAGATAACCCAGCTGCATTTTCGGACAGTATAATCACAAAGGTAGCTACATATAATTATATATATTTCTTAAATTTCACACTACTCGTTTGGCCGCAATGGTTGTGCTACGACTGGTCGATGGGATGCGTTCCACTTATAAATAGTGTTCTAGATTTTAGAATACTGCTGCCAGTGATCTTATATATATATGCAGTATTATTTGTTAAATTTGTTATTACCAACGGAATTCATTCATTTCCACAGGCAAGATTATTAATCATGTCAGTAGTTCTTATAGCACTACCATTTCTGCCAGCTTCAAATATAGTGTATCCAGTTGGCTTTGTTATAGCTGAAAGAATATTATACATACCATCTATTGGCTATTGTTTTTTGATAGCAATTGGAGCTAATAAAATAGTTAGAAAGATCAATAGAAAGGTGGTTATTTGCGGTTTTTACGCCATGATATTAATTTATTTATTGAAAAGTTGGAATAGATCATTTGATTGGAGGAATGAATATGATTTGTTCACGAATGCACTTAATGTGTGCCCATTGAATGCAAAGGTACATTACAATGTAGCTAAAGTAGCTGATGCCAAACAAAATAACAGCTGGGCTCTGGCAGAATATAAAGAAGCAATAAGACTGTATCCCGAGTATTACCAAGCGATGAATAATTTGGCAAATTTATTGAAAAATCAAAATCAATATACTGAAGCCGAACTATATCTTAAGAATGCTCTACACTATAAACAAGAATTCCCAGCAGCCTGGATGAACCTCGGCATAGTGTTGGCCAACACCAAGCGATATGAGGAATCTGACAACGCCTACAAAACTGCTTTGAAATATCGCAAAAAATATCCAGACTGTTACTACAATTTGGGAAACTTGTACTTAGAAATGAACAAAACAAACGAGGCAATAGAAAGTTGGCACAAAGCAATCAATTTGAATCCCAAACATGTATCTGCTTGGACAAATTTACTTGCCCTTTTAGATAACACGGGGCAAACTAACAGAGCGTTACGGATAATACCACAAGCCCTCTCGGACGTGCCCGAGATGCCGTCAATTAATTTCGCCATAGCAAATATTTACGGCAAAATAAACAACTATGTTGAAGCGGAAAATTACTTCAAGAAAGCCATCAATTTGTTTGGTGACAGAGTACAAGCTATTCACTTTGCGAATCTAGGAGTACTCTACCATCGTTGGAAGAAGTATGAGTTAGCAGAAGCAATGTACAAAACGGCTTTAAAAATCGATCCCAGATTTCCTAGTGCTAAAAAGAATCTCAACACATTAAATAAATTAAAAAATAACCATTACACACTCTTTTTATTTGTTGTAACATCACCAATAAATACAAGTTTGTCTTAA

Protein sequence:

>DPOGS208637-PA
MPTVILLDVSLSMSRPLPNSDSTETHTRFTLATAAINTFLDYLCVHAKLEYVALVTFSSLCEVSVPFTREFDNIRVKLPTLEEGDKTCIETALHGVNQLVLNEWGYQTAVQIILITDGSCGVGSIGRNRIIKALPLPPTYPGKIHTINLIISHSSMPLYQKIVDLASNSVNNTNVTISRGSIYCPDQLNIPGVIAAMTRLCEQHYQEFWCTLKCGQLETRVQLFPAPQPASQDCLGATYTLSNQLHVIGFLTQQDLGTPIAISKHLVIPQAQVASNAPHRENYDPKTPTKESSSSDGTSTDDDMSDPSKVPNFCVLLHGALKVEGMSAIVQLGVDWWGTLSAWCEVSRARRSCLLLSVMRPSASAAPWLGPLDQLGPSEDNSTSTETFPVRSWRSYSGGSGYAWARPHTLLADVQKVLRHARKLPDKTQHLYKELNRLRRAAISLGFSELLTCVSTALERECTTLPPSAPPECALQLAHAAAALRDPRTALDVKHKILYITFVSSIPFMFSLQGDFVFDDSEAIVKNKDISSDSWVQPFFNDFWGTNIRSNLSHKSYRPLTILTYRLNYFLSNKNLTATQFKITNLLCHVACCLLVWRTYSCIWERFKGKYVMSSTLNVPVIATLMFGVHPIHVEAVCGIVGRADLLSALTFLLSFLIYDKSIKTDSYIYLFLSLIIASASMFFKENGITVLGVSCIYDLLCNINKRDNKKKLSDYTCLKNIHINIKCACRIICVVASAIILLYMRWIIMGRNTPEFKPTDNPAAFSDSIITKVATYNYIYFLNFTLLVWPQWLCYDWSMGCVPLINSVLDFRILLPVILYIYAVLFVKFVITNGIHSFPQARLLIMSVVLIALPFLPASNIVYPVGFVIAERILYIPSIGYCFLIAIGANKIVRKINRKVVICGFYAMILIYLLKSWNRSFDWRNEYDLFTNALNVCPLNAKVHYNVAKVADAKQNNSWALAEYKEAIRLYPEYYQAMNNLANLLKNQNQYTEAELYLKNALHYKQEFPAAWMNLGIVLANTKRYEESDNAYKTALKYRKKYPDCYYNLGNLYLEMNKTNEAIESWHKAINLNPKHVSAWTNLLALLDNTGQTNRALRIIPQALSDVPEMPSINFAIANIYGKINNYVEAENYFKKAINLFGDRVQAIHFANLGVLYHRWKKYELAEAMYKTALKIDPRFPSAKKNLNTLNKLKNNHYTLFLFVVTSPINTSLS-