Monarch geneset OGS2.0

DPOGS210801
TranscriptDPOGS210801-TA3147 bp
ProteinDPOGS210801-PA1048 aa
Genomic positionDPSCF300027 - 876607-891499
RNAseq coverage173x (Rank: top 50%)
Annotation
HeliconiusHMEL0056560.081.88% 
BombyxBGIBMGA007117-TA0.074.84% 
DrosophilaCG6954-PA4e-15751.57% 
EBI UniRef50UniRef50_D6WUP75e-16757.17%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WUP7_TRICA
NCBI RefSeqXP_972600.21e-16757.17%PREDICTED: similar to CG6954 CG6954-PA [Tribolium castaneum]
NCBI nr blastpgi|2700113702e-16657.17%hypothetical protein TcasGA2_TC005383 [Tribolium castaneum]
NCBI nr blastxgi|2700113705e-16845.65%hypothetical protein TcasGA2_TC005383 [Tribolium castaneum]
Group
Gene OntologyGO:00071653.1e-07signal transduction
KEGG pathway 
InterPro domain[215-276] IPR0206838.5e-10Ankyrin repeat-containing domain
[356-464] IPR0089574.3e-09Fibronectin type III domain
[932-1033] IPR0001593.1e-07Ras-association
Orthology groupMCL13392 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210801-TA
ATGGAGACCCTCCTCGATAAATTCAAGAGGGACCAAGGAGGATTACGGAGGTCCAGATCCGTCCGCGCTTCCCTACGACTGATAGGTAACCGTTGGCGATCAACCAAGGACGAAGAAACGTCTGAGAACATTGACACCATCAAAGACGGAAACGTAGTACTGACGCAGATCTTTAACGGTGATAAAGATTACAACGTAGCGTACACGGGCATTGACGGTACTTACAAAGCAAAGACACCGAACATGGCCAAGAGGAAGCAGGAGCCGGTGAAGCCGAGGAAGAAGAGCGGAGACATCGATCTCACACTGAAGCCACAGCGGCATAGGAAAGTAGATAAGGCGAAAATAAGTGATTTATTCAATAAGAAACAGAAGAGTGTGTCCGCGAAAGAATTGCTCGTGCCGGAGAAGATACCACCGAAAGCGGCGGCCATATTACACATAAATTCACCGGATACCAAACTTAAGATTAAATTAAAATCGGTCGGCAAATCCGAGAGTGCTAAATCGGTGACCGATATAAAACATCGTCGCACGCGCAGGGGCTCCGAGAGCGATATGTGTGGGAGCCAGCCACGTGGGTTTGTCAACCAAGCCTTCGTATACAGCACACCGCCAAAGGAACGCAAGCTAAACCTATCACCATCAAGCTATTTGACTCTCTTCGCTGCCGTTGAACATGGCTACCTCGATAAGGCGAGAAACATCCTGGAGTCTACTGATGTCGATGTTAACAGCCTCAACCCAGACGGCCTGTCTCCGCTGGACGTGGCGGTGTTGGCCAACAACCGCCATCTGGTTAGAATGCTCATGGAGTTTGGAGCCAAGGAAGGCAGTCAATTCAAGAGCCCAGAAGCCCTCGGGAACCATCTCCGCAGACTGTCTCGCGAGGCGGAAGCCAGACTCCACGAGGTCACGGGGTACATACCTGAGGCTTGTAGAGAGGAAATGTGCAGATCATCGGGTGGCAACCAGGCCGGTACGATGGGTTGCGCTGGAGGTACCGGCTCGGAGAAGGACAAACAAACACAACACTGGGAGAGGCGGCTTAGAACATTGAAACGATTGGTGCAAGGTTGGCAGCAAGCTCGTGTCCCTCCAGCCTTGCCCTCACCCTCGTTGGAGGTCTGCGGACCGCACTCCGTCACCGTGTGGCTAGCGACACGGACCCAGGCACGCGACCCGCCGCTCGTTACCAAATACAAAGTGGAGTGGTCGTCTCGTGCGGATTTCTCAAATGTGTGCGGAAGTCGCGAGGTGGTGGCGTCCTGTGCTCGCGTGTCCGTGACGGGTCTGACCCGCGGCAGGCGGTACTTCTTCAGGGCTGCGGCCGGGAACGTGAAGGGCTGGGGCACTTACAGTGTGTCAGTACCTAGGAGCGTTGTACCCAGCAGCTGGAGTGATATATCAGTTCGTTCGTCTCGCGGGGAGGCTGGTGGCGCTGCGGCAGCGCTGGAAGCGCTCGCTGCGGCCACCGCAAGCACTCGAACCAGAACAGTCACAAGACTTCCTAGGAAGAAGACGGCCACTATACGACAACTGTTCACAGCGGCCAGCAAGTTCCAGAAGAACTTGAGGAGAGGCGTGTACTTAACTTGCATCATCTACCACGAGGACCGCGTCTTGGTCACCGGCGAGGAGTTCCTTCCGGTCGTTGAAGTGGATGAGACATACCCGGCCTGCATCTACACGGACTTCCACTGGCTGATGAAGGTGTCCTGTTCCTGGGAAGAAGTGAAGTCTCTGAGATCTGATATGGAGAAACACACGTCGTCGTCAATACACTTCAGACTGAAACTCCTCACAGCCGTCGCTCAGATGCAATCCGCGCTGGGGATACAAGACCTTGGCCAACTCTACCACAAACCGTTACGAGATTCTCACGGCACTGTCGTCCTCTCCTGTATAACCTCCGTGAAGTCACCCAAGGCCGTGTCGGCTTTGAACTCCCGCTGGCTGCCCGTCAACAAGCTGAGGCGACGCGTGCTCAGCGACGACAACACCATGGGAGAACTGCTCATGGCATCAGTTCACGAACAGATTGCCTACCATCAGGTGTCACGTGACCTGCTGCCGCGTGGTCTGTATCTCGGGTATTTGAAGTTGCAATCGTCGGTGGAGGTGGTCCGCATAGTGGCGCCATCTCGCACACCGAATGTACCACCACACACACGAGTCAGAGATAATCCTCATGTTTCCGCTGAGGAATGGGAGTACCTCAAGTCACAATCCGGCCGTTCTAACAACTACATGGGCAGCAACTCCCAGTTCGATATCAAATCTAACGGATCCCACGCTTCGATATCCAGCAACCGTCTGCCGCCATCGCGCAGCGAAGACACGCTGGTGCTACAAAACGAGAACATAGAACCCAAGCCAAGGCCGCACAGCATCAACACGAGTGTCTCAACCAGCTCCAGTCCGCTACTGACAGTGAAAGGCTTCTATCCCGGGAGCATGATCAGTGTGAAAACAAACAAGACGAACGTGTCGGAGTCCGCTCAGAGTCTGTCCAGCGATTCAGAAAGTCAAAGCTGTCCGCTCACAACGACAGTTCCTCTAAAAATCCGTCCGCAACTCACGCACGCAAACGTGACAGCTTCAAAAAGTATGACCAACGTGAAGTCGACGGACGTCGAATTCACTGACACGGGACAGGGGAGTAAGGGTCACAGTGTCAAACGTCAGCCACTGTTGGAGACACAGGAAGAAGAATATAATAATAAGGTCCAGAAGACTGACCAGAGACACAGTGACGATGACAGGAAACTACCTCCAGAACAACGAATTGATCAAGCTGGAATTTTACAGGTGTTCGCAGCCTACGAGACGGGCTTGGCTGTGGGTACATCACTCAAGTTGCACGTCACTCCACGAACATCAGCACGAGAGGTCATTGACCTCGTCGTCAAACAACTCAACATGGCCGCCGTTTTAAAAGGAAAATCCGGACCTGTCTACGGTCCGGAAAAGCTTCAAGATTTTTGCCTCGTAGCAGTTATAGGAGCCAGAGAGAGATGTCTACGAGACGACTTCAGACCTCTACAATTGCAGAACCCTTGGAGGAAAGGAAGATTATACGTCAGATTGAAACACGACGTGCTGGCTGCATTACAACATTCAGCCAAACAGCCCGCTTATATATAA

Protein sequence:

>DPOGS210801-PA
METLLDKFKRDQGGLRRSRSVRASLRLIGNRWRSTKDEETSENIDTIKDGNVVLTQIFNGDKDYNVAYTGIDGTYKAKTPNMAKRKQEPVKPRKKSGDIDLTLKPQRHRKVDKAKISDLFNKKQKSVSAKELLVPEKIPPKAAAILHINSPDTKLKIKLKSVGKSESAKSVTDIKHRRTRRGSESDMCGSQPRGFVNQAFVYSTPPKERKLNLSPSSYLTLFAAVEHGYLDKARNILESTDVDVNSLNPDGLSPLDVAVLANNRHLVRMLMEFGAKEGSQFKSPEALGNHLRRLSREAEARLHEVTGYIPEACREEMCRSSGGNQAGTMGCAGGTGSEKDKQTQHWERRLRTLKRLVQGWQQARVPPALPSPSLEVCGPHSVTVWLATRTQARDPPLVTKYKVEWSSRADFSNVCGSREVVASCARVSVTGLTRGRRYFFRAAAGNVKGWGTYSVSVPRSVVPSSWSDISVRSSRGEAGGAAAALEALAAATASTRTRTVTRLPRKKTATIRQLFTAASKFQKNLRRGVYLTCIIYHEDRVLVTGEEFLPVVEVDETYPACIYTDFHWLMKVSCSWEEVKSLRSDMEKHTSSSIHFRLKLLTAVAQMQSALGIQDLGQLYHKPLRDSHGTVVLSCITSVKSPKAVSALNSRWLPVNKLRRRVLSDDNTMGELLMASVHEQIAYHQVSRDLLPRGLYLGYLKLQSSVEVVRIVAPSRTPNVPPHTRVRDNPHVSAEEWEYLKSQSGRSNNYMGSNSQFDIKSNGSHASISSNRLPPSRSEDTLVLQNENIEPKPRPHSINTSVSTSSSPLLTVKGFYPGSMISVKTNKTNVSESAQSLSSDSESQSCPLTTTVPLKIRPQLTHANVTASKSMTNVKSTDVEFTDTGQGSKGHSVKRQPLLETQEEEYNNKVQKTDQRHSDDDRKLPPEQRIDQAGILQVFAAYETGLAVGTSLKLHVTPRTSAREVIDLVVKQLNMAAVLKGKSGPVYGPEKLQDFCLVAVIGARERCLRDDFRPLQLQNPWRKGRLYVRLKHDVLAALQHSAKQPAYI-