Monarch geneset OGS2.0

DPOGS215446
TranscriptDPOGS215446-TA2571 bp
ProteinDPOGS215446-PA856 aa
Genomic positionDPSCF300298 + 200361-215382
RNAseq coverage850x (Rank: top 15%)
Annotation
HeliconiusHMEL0225840.073.03% 
BombyxBGIBMGA013427-TA0.064.89% 
DrosophilaAnk2-PU0.065.78% 
EBI UniRef50UniRef50_UPI0002064CAA0.069.19%UPI0002064CAA related cluster n=1 Tax=unknown RepID=UPI0002064CAA
NCBI RefSeqXP_001807645.10.070.60%PREDICTED: similar to ankyrin 2,3/unc44 [Tribolium castaneum]
NCBI nr blastpgi|1892357520.070.60%PREDICTED: similar to ankyrin 2,3/unc44 [Tribolium castaneum]
NCBI nr blastxgi|2700044880.070.01%hypothetical protein TcasGA2_TC003843 [Tribolium castaneum]
Group
Gene OntologyGO:00055159e-10protein binding
KEGG pathway 
InterPro domain[420-827] IPR0206835.9e-118Ankyrin repeat-containing domain
[748-778] IPR0021109e-10Ankyrin repeat
Orthology groupMCL10139 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215446-TA
ATGGCCGTGTGGTGCGGAGCGCGCATACTGCCTCTAGAGAGACGCGTCTACAGCATCACGCTAGTGATAGACCCCAACACAGCGTTCTTGCGCGCAGCCCGCGGCGGTCAGCTAGACACAGTCATCGACTTGTTAGACTCGGGGGCTGTTAAAGATATAAACACATGTAATTCGAATGGACTCAATGCGTTGCATCTAGCCGCTAAAGATGGTCACATTTCAGTGGTAGAGGAACTCTTGAAGCGCGGTGCTACAGTTGACGCAGCTACAAAGAAAGGCAATACAGCTCTCCACATAGCATGTCTCGCGGGGCAGGAGTCGGTAGCGAGAGCGCTTCTCGGCGCAGGGGCGAAGGCCGATGCTCAGTCTGCTGCAGGTTTCACACCATTATATATGGCGGCTCAGGAAAACCATGCTGGATGTGTCAAGATGTTGCTCGCCGCTGGAGCCAGTCAGACCCTCGCGACAGAGGATGGCTTCACACCTTTAGCGGTGGCGATGCAACAAGGACATGACAGAGTGGTCGCTGAACTGCTAGAATCAGACACTCGCGGTAAAGTAAGACTACCCGCGCTACATATCGCAGCTAAGAAAAACGACGTTAAGGCAGCTACATTGTTACTCGAAAACGAACACAACCCTGATGCTTGTTCCAAATCTGGCTTCACACCCCTGCATATTGCAGCACACTACGGCAACGTAGGGGTGGCGAAGGCTTTGTTATCATCAGGGGCGGACCCTGGCAGGGCAGCTAAACATAACATCACCCCTCTACATGTAGCCAGCAAGTGGGGGCAGCTGGCTATGGTAGACCTTCTTGTTGAAAACGGAGGAAATATAGCAGCAATGACGCGAGATGGATTGACGCCTCTACATTGTGCAGCTCGTTCTGGTCATAGCAATGTGGTATCTAGACTTCTACAGCATGGAGCGCCCATCACAAGCAAGACAAAGAACGGTCTTACCCCGTTGCATATGTCGGTTCAGGGTGAGCATGTTGAGACTGCACGTGCTTTACTATCAGAGGGCGCGCCCATCGATGACGTCACTGTAGACTATCTCACCGCTTTGCACGTGGCCGCACACTGCGGACATGTCAAGGTTGCGAAATTACTTTTGGATAGAAATGCAGATGCGAATGCCAGAGCTCTAAACGGCTTCACACCACTACACATTGCATGCAAGAAGAACAGACTCAAAGTTGTTGAACTGTTACTCAAATACGGAGCGAGTAAATCAGCGACAACCGAATCAGGGTTGACACCACTGCATGTCGCTTCGTTCATGGGTTGTATGAACATTGCGCTAGTCCTGGTGGGGGCGGGCGCCTCCGCTGACGCTGCCACAGCCCGTGGAGAGACACCCCTGCATCTTGCGGCACGAGCTCATCAGACGGACTTAGTTAGAGTGCTGCTTAGGAACAACGCTAAGGTTGAAGCTCGCGCTCGTGAAGAACAGACGCCATTGCACGTGGCAGCTCGACTCGGCCACGCGGACATCGCGGGACTTCTCATACAACACGGAGCTGACGTGGCCGCCAATACTAAGGACAAGTACACACCGCTACATATCGCAGCTAAAGAGGGTAAAGAAGAAGTAGCCTCAATCCTCCTAGACAACAACGCGCCTATAGAGGCGGAAACCAGAAAAGGCTTCACTCCACTCCACTTGGCAGCCAAGTACGGTGATATTGGAGTGGCCAGGCTGTTGTTAGCGAGAGGCGCTCAGCCGGACGCGCCCGGGAAAAGTCATATCACACCATTACATATGGCCACATACTATGGACATCCGGACATCGCATTGCTTTTACTGGATAAAGGTGCCTCTCCACACGCGTTGGCCAAAAACGGTCATAGCGCCCTCCACATCGCGTGTCGTCATAACCATCCAGATATTGCGTTCGCATTGCTTGAACACGATGCGGATCCCTCAGTGAAATCTAAAGCTGGTTTCACACCACTACACATGGCGGCTCAAGAGGGACACGAGGACTGTGTGGAGATGCTCATAGAGAGAGGAGCGGATATAAACGTACCAGCTAACAATGGTCTAACTCCGCTACACCTGGCAGCGGCTGAAGGCCGTACAGCTGTATTGAAGTCCCTCCTGTCAGCTGGTGGGCGATGTGCTGCACGGACCAGGGACGGATACACCCCCCTCCACGCCGCCGCCCATCATGGGCACCACGCGGCCGCACGAGCGCTCATAGAGGGCGGGGCTGACGTCACAGCTAGAGCCGCCCACGGATTCACCCCCCTCCACCAGGCGGCTCAGCAAGGACACACCCTCATCATACAACTGTTGCTTAAGAACAACGCGGATCCTAACGCCTTATCAGCTAGCGGTCACACTGCGTGCGCGTTGGCCGACCGCCTCGGCTACATCAGCGCGGTTGAAGCTCTAAGGCCGCTCACGCAACACACGCTGTCACAGGCCGTGGGAGACTCGGGTACGTTAGTGTGTTGTACGCCTTATTTAGTATTATGGATTCCATACATTTCTGATACATTCATAGCAGTTTGGATCATCGATTTTAATATGTACCAGAATTATAATTCCTTCATAGATTAG

Protein sequence:

>DPOGS215446-PA
MAVWCGARILPLERRVYSITLVIDPNTAFLRAARGGQLDTVIDLLDSGAVKDINTCNSNGLNALHLAAKDGHISVVEELLKRGATVDAATKKGNTALHIACLAGQESVARALLGAGAKADAQSAAGFTPLYMAAQENHAGCVKMLLAAGASQTLATEDGFTPLAVAMQQGHDRVVAELLESDTRGKVRLPALHIAAKKNDVKAATLLLENEHNPDACSKSGFTPLHIAAHYGNVGVAKALLSSGADPGRAAKHNITPLHVASKWGQLAMVDLLVENGGNIAAMTRDGLTPLHCAARSGHSNVVSRLLQHGAPITSKTKNGLTPLHMSVQGEHVETARALLSEGAPIDDVTVDYLTALHVAAHCGHVKVAKLLLDRNADANARALNGFTPLHIACKKNRLKVVELLLKYGASKSATTESGLTPLHVASFMGCMNIALVLVGAGASADAATARGETPLHLAARAHQTDLVRVLLRNNAKVEARAREEQTPLHVAARLGHADIAGLLIQHGADVAANTKDKYTPLHIAAKEGKEEVASILLDNNAPIEAETRKGFTPLHLAAKYGDIGVARLLLARGAQPDAPGKSHITPLHMATYYGHPDIALLLLDKGASPHALAKNGHSALHIACRHNHPDIAFALLEHDADPSVKSKAGFTPLHMAAQEGHEDCVEMLIERGADINVPANNGLTPLHLAAAEGRTAVLKSLLSAGGRCAARTRDGYTPLHAAAHHGHHAAARALIEGGADVTARAAHGFTPLHQAAQQGHTLIIQLLLKNNADPNALSASGHTACALADRLGYISAVEALRPLTQHTLSQAVGDSGTLVCCTPYLVLWIPYISDTFIAVWIIDFNMYQNYNSFID-