Monarch geneset OGS2.0

DPOGS210910
TranscriptDPOGS210910-TA2295 bp
ProteinDPOGS210910-PA764 aa
Genomic positionDPSCF300045 - 76267-103682
RNAseq coverage153x (Rank: top 53%)
Annotation
HeliconiusHMEL0158153e-13880.80% 
BombyxBGIBMGA003093-TA3e-12870.79% 
Drosophilaspir-PA1e-7039.09% 
EBI UniRef50UniRef50_B4KLF85e-11136.31%GI22759 n=8 Tax=Coelomata RepID=B4KLF8_DROMO
NCBI RefSeqXP_002090799.11e-12638.53%GE13304 [Drosophila yakuba]
NCBI nr blastpgi|1954847002e-12538.53%GE13304 [Drosophila yakuba]
NCBI nr blastxgi|2236346995e-12739.36%RecName: Full=Protein spire
Group
KEGG pathwaydya:Dyak_GE133043e-126 
 K02098 (SPIR)maps-> Dorso-ventral axis formation
InterPro domain[527-605] IPR0110114.9e-11Zinc finger, FYVE/PHD-type
Orthology groupMCL10622 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210910-TA
ATGAGTGTGTTATTCCTGCAGCGTTGTATATGGCACTGCGGCGGTGGAAGTGGTCGTTTGGCTCGTGAGACGGCCGCTGGTCACTACCGCGCCGTCTGCCGGGCGCTGGTCGCCGAAGCCCTCGAGCTGGCCTCGTTCCTGGCCCGTGTTCGTGTCACAGGAGCCAGGGACCTCGGAGCCGCTGAGGACTCCGCCACTCATCTCGACACTCTGCAGTTCTCAGATTGGGCGCGGTTCTGGATGCAAGTTATCGGAGAGTTGCGAATGGGAGTGAAACTGAAGAAGGTTAACTACTCACGGACGCCAATAGAATATGAATTGACTCCATATGAAATCCTCATGGACGATATAAGGTCGAGGCGGTATACGTTAAGGAAAGTTGATGGCGCCATACCTCAGAATGTTAAGAAGGACGCACACGCAATGATACTCGAATTTATTAGAAGTAGACCGCCGCTCAAGAAGGCATCAGAACGCAAGTTGGCCCCGGTCCGTCGCGAGGTGACTCCAAGGGAACAGCTGCTAGCTTCCATACAACTCGGGAAACAGTTGCGACCTACGACGCAATCCAGGAGTATATTCAAGCTGCGGCGGTCGCGGGTTTCTCCGGCAGCTTTGCTGCTGGGGTCACGACCTGTCTCGGGGCTCGTAGAACCGATCCGGAGTCCCGAGGAGCCCGCGCTGGATGTGGCCCAGGACAAGTCTTATTGGGGCAGCGTCGCAACCGCTACGTCAACTACTAGTGGTTCAGGCAGCGGTGCCGGTGGTGGCGAGTCTCGTAGAGGAGATGTGACGCCACACGCACCGCCACAGAGACGACTCATTAAAGTTGATTTTGACAATCTCGAGGATGACGAAGACGAAGACGACACAGAGACTTGTGTCAGTCCTGAAACCTGCCCGCCCCAGCCCTTCCCGAGGCACACCATCAAACAGACACCGCCGAAACCATGGAAGAGAACCGGTTGGTATCACCACCTACGACCTGGCTACCCAATGTCCCTCCCGAAGAGCGTCCATGCGTCGCCACACAATAACACACGCACATGCGTGCCCCGCGACCGACGGCGCACAGTCACTGCCACATTCGAGGATGACGAAGACGAAGACGACACAGAGACTTGTGTCAGTCCTGAAACCTGCCCGCCCCAGCCCTTCCCGAGGCACACCATCAAACAGACACCGCCGAAACCATGGAAGAGAACCGTGTCCGTGGATACGCCGCGGGCTCCTCCCCGCGCGCGCATCTCACACGACGAGTACCATCAGTTTTTCGACGAGACCCTCGAATCCTACGACCTGGCTACCCAATGTCCCTCCCGTCGAGCGTCCATGCGTCGCCACACAATAACACACGCACACGCGTGCCCCGCGACCGACGGCGCACAGTCACTGCCACATTCGAGACCGGGTTCTCGTGCGTCCTGCGCAGGGGCGTCGCTCAGCAGTGACGACGCTCAGCTGAATGAGTTGTCGTGGTCCCGAGCCTCGCTCCAGGACGAGCTGATCAAGTCGGTTAGTGACAGCCCTACCAGCACGCATAAGCAATGGCAGGACGCGATTATATCAGACGAGCGTCTCTCGCTAACGCTAGAGGAGATCGTCCACATCAGATCTGTGCTGACGAAGGCTGAGCTAGAGGTTCTGCCGGTGGAGGGCAGGGTCAAAGAAGACGTGGAGAGACGACGGGTTTGTTTTCTGTGCCTGAAGACGAGGTTCGGCATCTTCGGTCCTTGGGGTCAGAAATGCAAACTGTGCAAGAAGACCGTCTGTCAGAAATGTTGTTCCAAGATGCGTATCCCGACGGAACACTTCGCGCACGTCCCCGTTGCGTTGTTGTCGCCGTCCTTGCTCCCCTCCCCCGACGAGGAGACCGCCTTCCCCCGCTCGCTGATGGCGCGGCTCGTGTTGCCCGAACACGCGGCGTCGGTAGAGAACAGTGTAGGCTCGGCCCCCAGCAGCCCGGGCTCGCGCCGCATCACGTCAGCCCCGGGCTCCCGCGGCGCCTCCGCCCTCGGTTTCTCGGACTCCTCCGGCGGCCCCGGCAGCATGCCCTGTGTCTCAACACCGCTATCCACATTCTCCACATTCGATCGCAGAGCGAGATACGGTCGCAGTGCGGGTTCGGGCGCGGCGGAGAGACTGCGAGGCGTGCAGATGGCAGTCTGTCACGACTGTAAGGCGATGGTGCTTCAGATCATAAAATCCTCGCGCGCCGCCCGCTCCGCTTCCCGCGACCGCGCGCTGCGACACCTCACCCTGGACCTGGCGCCCGTATACACCGCGGACTGCTAG

Protein sequence:

>DPOGS210910-PA
MSVLFLQRCIWHCGGGSGRLARETAAGHYRAVCRALVAEALELASFLARVRVTGARDLGAAEDSATHLDTLQFSDWARFWMQVIGELRMGVKLKKVNYSRTPIEYELTPYEILMDDIRSRRYTLRKVDGAIPQNVKKDAHAMILEFIRSRPPLKKASERKLAPVRREVTPREQLLASIQLGKQLRPTTQSRSIFKLRRSRVSPAALLLGSRPVSGLVEPIRSPEEPALDVAQDKSYWGSVATATSTTSGSGSGAGGGESRRGDVTPHAPPQRRLIKVDFDNLEDDEDEDDTETCVSPETCPPQPFPRHTIKQTPPKPWKRTGWYHHLRPGYPMSLPKSVHASPHNNTRTCVPRDRRRTVTATFEDDEDEDDTETCVSPETCPPQPFPRHTIKQTPPKPWKRTVSVDTPRAPPRARISHDEYHQFFDETLESYDLATQCPSRRASMRRHTITHAHACPATDGAQSLPHSRPGSRASCAGASLSSDDAQLNELSWSRASLQDELIKSVSDSPTSTHKQWQDAIISDERLSLTLEEIVHIRSVLTKAELEVLPVEGRVKEDVERRRVCFLCLKTRFGIFGPWGQKCKLCKKTVCQKCCSKMRIPTEHFAHVPVALLSPSLLPSPDEETAFPRSLMARLVLPEHAASVENSVGSAPSSPGSRRITSAPGSRGASALGFSDSSGGPGSMPCVSTPLSTFSTFDRRARYGRSAGSGAAERLRGVQMAVCHDCKAMVLQIIKSSRAARSASRDRALRHLTLDLAPVYTADC-