Monarch geneset OGS2.0

DPOGS210101
TranscriptDPOGS210101-TA573 bp
ProteinDPOGS210101-PA190 aa
Genomic positionDPSCF300017 + 764090-768326
RNAseq coverage6201x (Rank: top 2%)
Annotation
HeliconiusHMEL0029782e-9285.71% 
BombyxBGIBMGA012680-TA7e-9596.49% 
Drosophilapk-PC4e-2541.58% 
EBI UniRef50UniRef50_E3WVE17e-6974.03%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3WVE1_ANODA
NCBI RefSeqXP_001945130.11e-7375.71%PREDICTED: similar to four and a half lim domains [Acyrthosiphon pisum]
NCBI nr blastpgi|3227938415e-7176.57%hypothetical protein SINV_04723 [Solenopsis invicta]
NCBI nr blastxgi|3320211593e-8281.71%Four and a half LIM domains protein 2 [Acromyrmex echinatior]
Group
Gene OntologyGO:00082704.1e-15zinc ion binding
KEGG pathwaydre:3682504e-24 
 K04511 (PRICKLE)maps-> Wnt signaling pathway
InterPro domain[104-168] IPR0017814.1e-15Zinc finger, LIM-type
Orthology groupMCL11109 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210101-TA
ATGGCTGACGTTGACGTACAAGTTCAGTATACCACCACTGAGAGAAAGACCAGGAAGGTCAAAAAGACATCCAAGCGTAGGGAGTCTAAGGATGGAGAGGTTTCCGTCACCGAACTCGAACAGACCAATACCACCAACGACGATGGCGGTGAATACACCAAGGCGATGAACAAGGACTGGCACAGCGGTCACTTCTGCTGCTGGAAGTGCGATGAGTCCCTCACCGGACAACGCTACGTCCTGAGAGATGAACAGCCCTATTGCATCAAGTGCTACGAGGGTGTCTTCGCCAACGGATGCGAGGAATGCAACAAGATCATCGGCATCGACTCCAAGGATCTGTCGTACAAAGACAAGCACTGGCACGAGGCCTGTTTCCTCTGCGCTAAGTGTCGCGTCTCTCTGGTGGATAAACAGTTCGGCTCCAAGTTAGACAAGATCTACTGCGGCAACTGTTACGACGCCCAGTTCGCCAGCCGCTGCGATGGATGCGGAGAGGTCTTCCGAGCTGTTTATTCAGTCATCTGTCCGTACATAGTTGGTGCTGTACCCTTGCTGGTTAAGGTGTTGTAA

Protein sequence:

>DPOGS210101-PA
MADVDVQVQYTTTERKTRKVKKTSKRRESKDGEVSVTELEQTNTTNDDGGEYTKAMNKDWHSGHFCCWKCDESLTGQRYVLRDEQPYCIKCYEGVFANGCEECNKIIGIDSKDLSYKDKHWHEACFLCAKCRVSLVDKQFGSKLDKIYCGNCYDAQFASRCDGCGEVFRAVYSVICPYIVGAVPLLVKVL-