Monarch geneset OGS2.0

DPOGS209892
TranscriptDPOGS209892-TA2109 bp
ProteinDPOGS209892-PA702 aa
Genomic positionDPSCF300049 - 396935-401449
RNAseq coverage580x (Rank: top 22%)
Annotation
HeliconiusHMEL0067520.071.29% 
Bombyx% 
Drosophilacactin-PA0.055.43% 
EBI UniRef50UniRef50_Q9VR990.055.43%Cactin n=12 Tax=Drosophila RepID=Q9VR99_DROME
NCBI RefSeqXP_975642.10.057.91%PREDICTED: similar to cactin CG1676-PA [Tribolium castaneum]
NCBI nr blastpgi|910873750.057.91%PREDICTED: similar to cactin CG1676-PA [Tribolium castaneum]
NCBI nr blastxgi|3838494490.057.18%PREDICTED: uncharacterized protein C19orf29-like [Megachile rotundata]
Group
KEGG pathway 
InterPro domain[578-702] IPR0191341.4e-57Cactin protein, cactus-binding domain, C-terminal
[240-421] IPR0188162e-53Cactin, domain
Orthology groupMCL15295 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209892-TA
ATGTCGTCACGACATATAAGTAAACACAGAGATATCTCTCGCGGTCGTTCAGTTGAGCATCATCGAGATTGTGGAGGACGAAGTGAAGACAAAATCTCTGAAAAAAGGTCAAGATCACCTAACAGAAAACGGAAGTCCGCCAGCAAAGATAGACGTAAGTCACCACAAAAAAAACACACGTCTTCTAGTAGTGAAAGAAAGAAAGAATCAAAGAAGAAGAAAAGTAAAAAGAAAAAAAAGTCACGTAATTCATCAAGTAGCAGCAGCAGCAGCAGCAGTAGCAGTAGTGACAGTGACGAGGAAGAGTTAAAACTACTACAGAGACTTGAAGCAGAGAGGTTGAGGCTGAAAGAAGAAAAAAGAAAGCAGAAAGAAATGATAAAAGCAAATGAAACACCCGAGGAGAAAAGAGCTCGACGTCTAAAAGAAAAGCAAGAAAAGGAAAGGAAGAGACGTGAGCGTATGGGGTGGGACAACGAGTATCAGTGTTACACCGATCAAGATAACCCCTTCGGAGACTCTGCTCTCACTGACACATTTGTATGGACAAAGAAACTGGCTAAGGAAGGTGTTAAGAATGTCTCTCACAACGAACTGGAGGCATTGAACAGGCAGAAACAATTAGAAAATAAAATTGAATTGGAGAAGGTGAAGCAGCGTCGACTGGAGCGCGAGGCTGAGCGTGCGGCACGCGAGGCCGAGGCGGCGGCGGCGGCCCGGGCCCGGGAGGCCGCGCAGTTCAGCAGCTGGGCTCGACACGAAGACGAGTTCCATCTGCAGCAGGCCCGGCTGCGCTCGCAGATACGGATACGAGACGGAAGAGCTAAGCCGATAGACCTCCTCGCGTGGTACGTGAGTTCCGAGCAGTGTGTCGATGCGCTCGAAATGCACGAGCCGTACACGTACCTGAACGGCCTGCACGCACAGGACCTGGAGGACTTACTGGAGGATATCAAGGTGTACAAGGAGCTGGAGCAGGACGTGAACCAATCGTACTGGGAGGACGTGCAGACTATCGTGTCGTCGGAGCTGGCGAAGCTGCGGCGCCTGGCTCCGGGCCGGGACGGGGTGCACGCCGCCGTGGCCGAAGACGTGGCCGGCGTGTTCCGCGGGAAGAGCACGGCCGCCCTGCTTCAGTTGCAGGACGCCATCGAACACAAGATGGCCGCCAGGACCGCCGGGATCGACGTGCACTACTGGGAGAGTCTGCTCAGCCAGCTCAAAGCTCACATGGCACGAGCTCGTCTCCGAGACCGGCACCAGAACAACCTCCGCCGCAAGCTGCAGCTGCTGAAGAGGGAACAAGGAGTCGCCGCGGACGAGCACGCGGAACACGAGGACAAACACACACACGGAGAGGGCGCTGGTCCGGAGCAGAAGTCGCCTCGGACGGAGAGCGAGGCGGAGGAGGCGGAGGCGGAGGGCGAGTCGTGGTGCGGAAGTTACTCCCCGCGGTACCTGGCGCCCGCCTCGCTGGAGCCCGCCACGCTGCTGCTGGAGCCCCACGAGGACCGCCAGCGCCTCGCCTTCCTCCGAGCCAGGCTGCATGCCGCCGCCGCCGCCGACCAGCACAAGGCCACGCTCGCTAAGCTTCCGGAGGCAGCTGATGCAGTGCCGGGCACCAGCACGGGCGCTCTGGAGGCGGCCGCGAGGCGCTCCATGGAGGGAGGCAGTGAGGGCGGCGCCGCACAGTTCAGTGTGGAGCACGTGCTGCCCGACCAGCCTTGCTTGTGGGCGGACAAGTACAGACCCAGGAAACCAAGATACTTCAACAGAGTCCACACCGGCTTCGAGTGGAACAAATACAACCAGACTCATTACGACATGGACAACCCTCCGCCGAAGATCGTTCAAGGATACAAGTTCAACATCTTCTACCCGGACCTCATCGACAAGAGCGCCACCCCTGAGTTCTCACTTAAGCCGTGTGCTGACAACCCTGAGTTTGCTGTGCTTCGTTTCCATGCGGGCCCACCCTATGAAGACATCGCCTTCAAGATAGTGAACCGTGAGTGGGAGTACTCCTACAAGAGAGGCTTCCGCTGTCACTTCCACAACAACATCTTCCAGTTGTGGTTCCACTTCAAGAGATACAGATACAGGCGTTGA

Protein sequence:

>DPOGS209892-PA
MSSRHISKHRDISRGRSVEHHRDCGGRSEDKISEKRSRSPNRKRKSASKDRRKSPQKKHTSSSSERKKESKKKKSKKKKKSRNSSSSSSSSSSSSSDSDEEELKLLQRLEAERLRLKEEKRKQKEMIKANETPEEKRARRLKEKQEKERKRRERMGWDNEYQCYTDQDNPFGDSALTDTFVWTKKLAKEGVKNVSHNELEALNRQKQLENKIELEKVKQRRLEREAERAAREAEAAAAARAREAAQFSSWARHEDEFHLQQARLRSQIRIRDGRAKPIDLLAWYVSSEQCVDALEMHEPYTYLNGLHAQDLEDLLEDIKVYKELEQDVNQSYWEDVQTIVSSELAKLRRLAPGRDGVHAAVAEDVAGVFRGKSTAALLQLQDAIEHKMAARTAGIDVHYWESLLSQLKAHMARARLRDRHQNNLRRKLQLLKREQGVAADEHAEHEDKHTHGEGAGPEQKSPRTESEAEEAEAEGESWCGSYSPRYLAPASLEPATLLLEPHEDRQRLAFLRARLHAAAAADQHKATLAKLPEAADAVPGTSTGALEAAARRSMEGGSEGGAAQFSVEHVLPDQPCLWADKYRPRKPRYFNRVHTGFEWNKYNQTHYDMDNPPPKIVQGYKFNIFYPDLIDKSATPEFSLKPCADNPEFAVLRFHAGPPYEDIAFKIVNREWEYSYKRGFRCHFHNNIFQLWFHFKRYRYRR-