Monarch geneset OGS2.0

DPOGS209132
TranscriptDPOGS209132-TA1542 bp
ProteinDPOGS209132-PA513 aa
Genomic positionDPSCF300061 - 1188091-1192406
RNAseq coverage79x (Rank: top 64%)
Annotation
HeliconiusHMEL0081473e-7072.73% 
BombyxBGIBMGA001843-TA5e-4647.73% 
DrosophilaBBS4-PA2e-2926.29% 
EBI UniRef50UniRef50_D6WTY14e-5732.64%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WTY1_TRICA
NCBI RefSeqXP_624286.22e-4228.99%PREDICTED: similar to Bardet-Biedl syndrome 4 [Apis mellifera]
NCBI nr blastpgi|2700108631e-5632.64%hypothetical protein TcasGA2_TC015903 [Tribolium castaneum]
NCBI nr blastxgi|2700108636e-5632.71%hypothetical protein TcasGA2_TC015903 [Tribolium castaneum]
Group
Gene OntologyGO:00054889.3e-26binding
KEGG pathway 
InterPro domain[158-397] IPR0119909.3e-26Tetratricopeptide-like helical
Orthology groupMCL14367 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209132-TA
ATGCGAAAACAGTTATCTGTTGAGAAATTGTCGACAAACGATTTTAAGAAAAATTCTTTCAACTGGCTATTATACGCGCGGTATGTTCGCGGCGAGAACACTTTGTGCCTGGAGCTGGTGGACCGTCTGCAGCAGGATGCAGATAACAAACACAGATACGCACATTATATTAAGGGTGCGATGTTAGCCGACGCGGGGAGGTTACAGGAGGCGGTGGAAAAGTATCACAGCTGTATCAGATTGCAGCCCCAAGATCCGGATCCTTTAAAACAGGTTGCGAAATGTTTGTATAAACAAGGCAGAACGCAATTAGCGCTCGAAGCTTATTTGGAGGCTGAAAAAAAATCGAAACATCCGGATCCCGACGTTTACTGCGGATTGGCGTCATGTGCGGCGTCCATCAGCGACGCTCGCGACGTGTCGTGGGCCCGCGCGGCCCTGTCCGCGGGCGGAGGAGAGCGAGCCGCCTCGCTGTTGGCCGCACGACTGTTGGCTCGCGGAGACACGAAGGACGCACTCGCGGTCTACGAACACGCTGTCAGTGAGTACTCGTGCGGGGCAGACACGTTGTCTGCGGCGGGAGCGTTGTGCCTGCGGGCGGGACTTCCCCGCCGGGCGTTCCAGTTACTAGGCGAGGCGCTGTCTCAGCAGCCGACCCAGCACGCCGCGGCGCTGGCGCTGGCCGCCATGCTGCTGCAGCACCGTGACGAGGACGCCGCCCTCGCCAGGCTCAAGGCCGCCCTCACCGCGCACCCGGATTGTGTCGCCGCACACTCCGACCTCGGCCTGGCTCTGTTCTCCAAGAAGAAGTTCATAGCGGCTCTGTGGTGTTGTCTCCGCGCGTCGTGGGCCGCGCCACTGAGCGCGGCGGCGGCTCACAACGCAGGCCTGGTGTTGCTCGCGAGCCGCCGCCCAGCCTCCGCCTTCTGCCGCCTCGCCTCCGCCGCCGCCCTAAATCCTCGAGACGCCTACACTGTTCTACTGATAGCGTTGTCGCTGGAGCGTGTGGGAGACGGTCGCGCGGACGCCGCGTTCGATCGCGCGTGTGAACTGGCCCCGCATGACGCGCTCGTGAGGATCAACTGCGGCGGGCGACACGCGCGTGTGGGACGACTGGAACACGCCGCTAGAGAGGCTGGCGTCGTAGCAAGGTTGCTAGAAGACCAACCGGATGCTCATTTGGCGAGCTCATTGGCCACATTGATGGCATTACTTACTGAAGCAGGAATTACTATACAAGTGACGGAACCTGATGAGACACACACGGCCAGAGAGGAACTGGCTTCAGACGAAGTATTCAGTGCAGATGAACGTCAAGCTGGCCTCATTATATTGCCTCCGATAATTTTAGGCTACGACGATCTAAACGATAAACCAGCACCAGAAGTAAAACTGCATCAATCAGTTGAAGAAAATAATATTGACAACGAAAAGAGCAGCGTCAAGAAACATGACTGTAACTATAACTTGAACGGCAGAGATGTCACATTCGTTGGAAAAATAATGGCGCGAAACAAAAAGTATCCCTGCCGCAATTCATAA

Protein sequence:

>DPOGS209132-PA
MRKQLSVEKLSTNDFKKNSFNWLLYARYVRGENTLCLELVDRLQQDADNKHRYAHYIKGAMLADAGRLQEAVEKYHSCIRLQPQDPDPLKQVAKCLYKQGRTQLALEAYLEAEKKSKHPDPDVYCGLASCAASISDARDVSWARAALSAGGGERAASLLAARLLARGDTKDALAVYEHAVSEYSCGADTLSAAGALCLRAGLPRRAFQLLGEALSQQPTQHAAALALAAMLLQHRDEDAALARLKAALTAHPDCVAAHSDLGLALFSKKKFIAALWCCLRASWAAPLSAAAAHNAGLVLLASRRPASAFCRLASAAALNPRDAYTVLLIALSLERVGDGRADAAFDRACELAPHDALVRINCGGRHARVGRLEHAAREAGVVARLLEDQPDAHLASSLATLMALLTEAGITIQVTEPDETHTAREELASDEVFSADERQAGLIILPPIILGYDDLNDKPAPEVKLHQSVEENNIDNEKSSVKKHDCNYNLNGRDVTFVGKIMARNKKYPCRNS-