Monarch geneset OGS2.0

DPOGS201048
TranscriptDPOGS201048-TA2940 bp
ProteinDPOGS201048-PA979 aa
Genomic positionDPSCF300299 + 135912-154020
RNAseq coverage107x (Rank: top 60%)
Annotation
HeliconiusHMEL0053611e-10178.45% 
BombyxBGIBMGA012489-TA0.087.87% 
Drosophilacoro-PB4e-11147.38% 
EBI UniRef50UniRef50_F4WLT30.069.02%Coronin-2A n=13 Tax=Coelomata RepID=F4WLT3_ACREC
NCBI RefSeqXP_001606043.10.069.67%PREDICTED: similar to ENSANGP00000029333 [Nasonia vitripennis]
NCBI nr blastpgi|3838568890.071.07%PREDICTED: coronin-2B-like isoform 1 [Megachile rotundata]
NCBI nr blastxgi|3838568890.070.59%PREDICTED: coronin-2B-like isoform 1 [Megachile rotundata]
Group
Gene OntologyGO:00055157.3e-56protein binding
KEGG pathway 
InterPro domain[41-973] IPR0155051.2e-255Coronin
[102-733] IPR0110467.3e-56WD40 repeat-like-containing domain
[257-393] IPR0150491.2e-54Domain of unknown function DUF1900
[507-738] IPR0159432.3e-38WD40/YVTN repeat-like-containing domain
[45-106] IPR0150481.3e-31Domain of unknown function DUF1899
[513-545] IPR0197818.1e-10WD40 repeat, subgroup
[502-545] IPR0016803.2e-07WD40 repeat
Orthology groupMCL14602 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201048-TA
ATGCCTCCTGTAAAAGATGATCTGATTTTTAACTGTGTAGAGGTAAACATACCTGAAAAAAGCCCAGGAAAAGGTACAGCATTAGTAGATGTGAGGGCCTGTAATAAACCTGCCGCTAAGGTATGGTTCCGTGGAGTTCGTAGCTCGAAGTTTCGTCACGTGTACGGTGTGCCGTTCAAACGTGAGAGATGCTATGATAATATAAAAATAACGAGAAACGCCCACGATTCCAACTTCTGTGCCGTGAACCCCAAGTTTGTGGCTATCGTCACCGAAGTTGCAGGCGGGGGAGCCTTCCTAGTACTGCCTTTGGATCACGTGAAGATATGGCACATCCCTGATGGTGGCCTATCTATGCACCTCACCGACTGGCTGGTTGAGCTCCACGGGCACAAGAGGCGTGTGGCCTACATAGAGTGGCATCCCACGGCTGAGAACATACTGTTTAGTGCTGGATTCGATTATCTGATCTTTGTATGGGATGTGGGCAAGGGCGAGGCTGTTAAGGTCATCGATTGCCACAGTGACGTCATCTATTGCATGTCCTTCAATCGTGACGGATCACTGTTAGCGACCACCTGCAAGGACAAGAAACTACGGGTTATAGAGCCTAGGCGAGGGATCGTGCTGTCTGAGGGGCCCTGTCACCTCGGCACCAAGGCTTCCAAGTGCACGTTCCTTGGCGCTCAGTGCAAAGTATTGACAACTGGTTTCTCGCGACACAGCGACCGTCAGTACGCCGTTTGGGACCAACACGACGTGAGCGAGCCCCTCGCCTCCGAGACCATCGACAGCTCCTCAGGAGTCGTCTTCCCCTACTACGATCACGATACCAACATGGTCTATCTGGCTGGCAAAGGCGACGGCAACATCCGTTACTACGAAGTAGTGGACGAGGCGCCCTACGTGCATTTCCTCAACCAGTTCCTGTCAGGCAACCCTCAGCGTGGTCTGGGCTTCATGCCTAAGCGTGGCGTGAACACATCTATGTGCGAAGTGTTCCGTTTCTACAAGCTGCACACATCCCGTGGCCTCTGCGAGCCCATATCTATGATCGTACCGCGCAAGTCCGACTGCTTCCAGGAGGATTTGTACCCTGACACGGCCGCTCCTCAGCCGGCCCTCTCAGCACGCGACTGGCTCAGCGGAGTAAATGCACCGCCACTACTCATTAGTATGAAAACAGGGGTGACGATATCCACGCACAAGCCCCGGAACAACAAGGACGCGCCCGCGCTGCAGCCGCAGGACGCCAACAACAGAAAGAAGTTCGCCTTTCTGTCGCGTGAGACGACCCCCGACTACCGCCCTCTAGCGACGTGGCAGGGCAACCAGGACGATACGCAGGTTCAAGTGACGGAGAAGTGTCAGAAGCAGAACACCAACCAGAACACGAAGTTCCACCAGCTCCAGAGGATGTTCGGCAAACAGGCCGGTGACGTGGAAGTAGTGCCGCTCTACAAACAGATCAACCAGGGAGACGTTTTCAACACGGAGCACGAGACGGGGCGTCTGGATTTCAACGCCAGCCGCGTCACCGGCCACAAGGGTCCAGTGTTGGATATCAAGTGGAACCCGTTCAACGACAATGTCATAGCCTCCTGCTCTGACGACTGCACGGTGAAGATATGGCACATCCCTGATGGTGGCCTATCTATGCACCTCACCGACTGGCTGGTTGAGCTCCACGGGCACAAGAGGCGTGTGGCCTACATAGAGTGGCATCCCACGGCTGAGAACATACTGTTTAGTGCTGGATTCGATTATCTGATCTTTGTATGGGATGTGGGCAAGGGCGAGGCTGTTAAGGTCATCGATTGCCACAGTGACGTCATCTATTGCATGTCCTTCAATCGTGACGGATCACTGTTAGCGACCACCTGCAAGGACAAGAAACTACGGGTTATAGAGCCTAGGCGAGGGATCGTGCTGTCTGAGGGGCCCTGTCACCTCGGCACCAAGGCTTCCAAGTGCACGTTCCTTGGCGCTCAGTGCAAAGTATTGACAACTGGTTTCTCGCGACACAGCGACCGTCAGTACGCCGTTTGGGACCAACACGACGTGAGCGAGCCCCTCGCCTCCGAGACCATCGACAGCTCCTCAGGAGTCGTCTTCCCCTACTACGATCACGATACCAACATGGTCTATCTGGCTGGCAAAGGCGACGGCAACATCCGTTACTACGAAGTAGTGGACGAGGCGCCCTACGTGCATTTCCTCAACCAGTTCCTGTCAGGCAACCCTCAGCGTGGTCTGGGCTTCATGCCTAAGCGTGGCGTGAACACATCTATGTGCGAAGTGTTCCGTTTCTACAAGCTGCACACATCCCGTGGCCTCTGCGAGCCCATATCTATGATCGTACCGCGCAAGTCCGACTGCTTCCAGGAGGATTTGTACCCTGACACGGCCGCTCCTCAGCCGGCCCTCTCAGCACGCGACTGGCTCAGCGGAGTAAATGCACCGCCACTACTCATTAGTATGAAAACAGGGGTGACGATATCCACACACAAGCCCCGGAACAACAAGGACGCGCCCGCGCTGCAGCCGCAGGACGCCAACAACAGGAAGAAGTTCGCCTTTCTGTCGCGTGAGACGACCCCCGACTACCGCCCTCTAGCGACGTGGCAGGGCAACCAGGACGATACGCAGCAGGTTCAAGTGACGGAGAAGTGTCAGAAGCAGAACACCAACCAGAACACGAAGTTCCACCAGCTCCAGAGGATGTTCGGCAAACAGGCCGGTGACGTGGAAGTAGTGCCGCTCTACAAACAGATCAACCAGGGAGACGTCTTCAACACGGAGCACGAGCTGCGACTCGCATTCAACCGTCAGGGAGAAGAACTGAGAATAGTAAAACGTCAACTACAGAACAGCCAACAGAGAGTGAGAGAGCTGGAACAACACATCGCCTCGCTACAGTCACGACTCAACACAGCCTAA

Protein sequence:

>DPOGS201048-PA
MPPVKDDLIFNCVEVNIPEKSPGKGTALVDVRACNKPAAKVWFRGVRSSKFRHVYGVPFKRERCYDNIKITRNAHDSNFCAVNPKFVAIVTEVAGGGAFLVLPLDHVKIWHIPDGGLSMHLTDWLVELHGHKRRVAYIEWHPTAENILFSAGFDYLIFVWDVGKGEAVKVIDCHSDVIYCMSFNRDGSLLATTCKDKKLRVIEPRRGIVLSEGPCHLGTKASKCTFLGAQCKVLTTGFSRHSDRQYAVWDQHDVSEPLASETIDSSSGVVFPYYDHDTNMVYLAGKGDGNIRYYEVVDEAPYVHFLNQFLSGNPQRGLGFMPKRGVNTSMCEVFRFYKLHTSRGLCEPISMIVPRKSDCFQEDLYPDTAAPQPALSARDWLSGVNAPPLLISMKTGVTISTHKPRNNKDAPALQPQDANNRKKFAFLSRETTPDYRPLATWQGNQDDTQVQVTEKCQKQNTNQNTKFHQLQRMFGKQAGDVEVVPLYKQINQGDVFNTEHETGRLDFNASRVTGHKGPVLDIKWNPFNDNVIASCSDDCTVKIWHIPDGGLSMHLTDWLVELHGHKRRVAYIEWHPTAENILFSAGFDYLIFVWDVGKGEAVKVIDCHSDVIYCMSFNRDGSLLATTCKDKKLRVIEPRRGIVLSEGPCHLGTKASKCTFLGAQCKVLTTGFSRHSDRQYAVWDQHDVSEPLASETIDSSSGVVFPYYDHDTNMVYLAGKGDGNIRYYEVVDEAPYVHFLNQFLSGNPQRGLGFMPKRGVNTSMCEVFRFYKLHTSRGLCEPISMIVPRKSDCFQEDLYPDTAAPQPALSARDWLSGVNAPPLLISMKTGVTISTHKPRNNKDAPALQPQDANNRKKFAFLSRETTPDYRPLATWQGNQDDTQQVQVTEKCQKQNTNQNTKFHQLQRMFGKQAGDVEVVPLYKQINQGDVFNTEHELRLAFNRQGEELRIVKRQLQNSQQRVRELEQHIASLQSRLNTA-