Monarch geneset OGS2.0

DPOGS213215
TranscriptDPOGS213215-TA1026 bp
ProteinDPOGS213215-PA341 aa
Genomic positionDPSCF300114 + 400463-402065
RNAseq coverage1341x (Rank: top 9%)
Annotation
HeliconiusHMEL0170885e-17496.77% 
BombyxBGIBMGA007410-TA3e-15793.31% 
Drosophilalark-PB1e-10965.32% 
EBI UniRef50UniRef50_F4W4I14e-11068.87%RNA-binding protein lark n=6 Tax=Formicidae RepID=F4W4I1_ACREC
NCBI RefSeqNP_001037293.11e-17293.88%RNA-binding protein lark [Bombyx mori]
NCBI nr blastpgi|1129838342e-17193.88%RNA-binding protein lark [Bombyx mori]
NCBI nr blastxgi|1129838340.093.88%RNA-binding protein lark [Bombyx mori]
Group
Gene OntologyGO:00036761.6e-20nucleic acid binding
GO:00001663.6e-20nucleotide binding
GO:00082706.7e-06zinc ion binding
KEGG pathway 
InterPro domain[8-73] IPR0005041.6e-20RNA recognition motif domain
[84-169] IPR0126773.6e-20Nucleotide-binding, alpha-beta plait
[169-185] IPR0018786.7e-06Zinc finger, CCHC-type
Orthology groupMCL11996 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213215-TA
ATGCCGGGCGCCGGTACTTTCAAAATCTTCGTCGGGAATCTTTCCGATAAAACCACAGATGCCGATCTCAGACCGCTGTTCGAAAAGTTCGGTACGGTCGTAGAATGCGATATCGTCAGAAATTACGGTTTCGTGCACATGGAAAACGAGCAAGTCGGCCGCGAAGCCATTCAGAACCTCAACGGTGAAGTAGTTCACGGGCAGGCGATCAAAATCGAGGCGGCCAAGAGTAGAAAGGCGCCGTCGACGCCGACCACTAAAATATTCGTGGGTAACCTAACGGACAAGACGCGCGCGCCCGAAGTCCGCGAGCTGTTTCAGAAGTTCGGCACGGTAGTGGAGTGCGATATCGTTCGTAACTATGGCTTCGTGCACCTGGACGCGTCGGGCGACGTGAACGAGGCGATCAAAGAGCTGAACGGCATGATGGTGGACGGGCAGCCCATGAAGGTGCAGCTCTCCACGAGCCGGGTGCGCCAGCGGCCTGGCATGGGCGACCCCGAGCAATGCTACCGCTGCGGCCGCGGTGGTCATTGGTCCAAGGAGTGCCCCAAGGCCATGGGCCCCGACCGCAACGGTTTCCGCGATCGAGCGTTCGGCCGCGACCCCTACCCCCCGCCGCCGCCGCCGCCCTTCCTCCGCGATCGCATGATGGGAGGATTTGGGGATCCTTACGATGGTTACTATGACCGAGCTCGGTTTGACTCTGCGCGGGACCTGTTCGAGCGGAGATACCCCGTGGGAGCTTCGAGGGGTCTGGACATGGCCCCCTCTCGCGCCCGAGGTGACTTCGTGTCCCCGCCCCTGCGCCGTGAGCCCATGCCGCCCATGCCTTCCCTGCCGCCCATGAGAAGCAGTATGGGTTCCATGAGGTCCTCGTATGATGCCATGTACAGTCGCAGGAGTCCACCCAGAGGACCACAGATGTCTAGAGGGATGTATGAAGATTTCAGCCGGGACACATTTGATGACAGAAGGCCGGGGATGCGAGGCCCCTCGCCTTCCAGAAGATACGCGCCCTACTAG

Protein sequence:

>DPOGS213215-PA
MPGAGTFKIFVGNLSDKTTDADLRPLFEKFGTVVECDIVRNYGFVHMENEQVGREAIQNLNGEVVHGQAIKIEAAKSRKAPSTPTTKIFVGNLTDKTRAPEVRELFQKFGTVVECDIVRNYGFVHLDASGDVNEAIKELNGMMVDGQPMKVQLSTSRVRQRPGMGDPEQCYRCGRGGHWSKECPKAMGPDRNGFRDRAFGRDPYPPPPPPPFLRDRMMGGFGDPYDGYYDRARFDSARDLFERRYPVGASRGLDMAPSRARGDFVSPPLRREPMPPMPSLPPMRSSMGSMRSSYDAMYSRRSPPRGPQMSRGMYEDFSRDTFDDRRPGMRGPSPSRRYAPY-