Monarch geneset OGS2.0

DPOGS203689
TranscriptDPOGS203689-TA1458 bp
ProteinDPOGS203689-PA485 aa
Genomic positionDPSCF300010 - 2030560-2032832
RNAseq coverage420x (Rank: top 29%)
Annotation
HeliconiusHMEL0133150.079.47% 
BombyxBGIBMGA003483-TA0.087.79% 
Drosophilaari-2-PA0.061.75% 
EBI UniRef50UniRef50_B0XBC40.062.16%Zinc finger protein n=8 Tax=Bilateria RepID=B0XBC4_CULQU
NCBI RefSeqXP_624643.20.066.32%PREDICTED: similar to ariadne 2 CG5709-PA [Apis mellifera]
NCBI nr blastpgi|3071881420.067.90%Protein ariadne-2 [Camponotus floridanus]
NCBI nr blastxgi|3071881420.067.90%Protein ariadne-2 [Camponotus floridanus]
Group
Gene OntologyGO:00082702.1e-15zinc ion binding
KEGG pathway 
InterPro domain[189-252] IPR0028672.1e-15Zinc finger, C6HC-type
Orthology groupMCL12642 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203689-TA
ATGTCTGCGGAATCTGATATGGAATATTCGGACAATGATGGAGACGACTATGACTATTACGGAGGACAAGATGATTGCGATATGGAAGCTGTGGACCGTAGCAAATCAGATCCAGAATATTTTCAATACACTTGTCTTAGAGTAGAAGAAGTGGAAAAATTATTGAATGAATCGATTGAACTACTTAGTAACAGTCTCCAAATAACACCTTCACTAGCGAAGGTAATGTTGCATGCTTATGAATGGAATGCTCAAGAGATTATAAAAAAATATAACGAAAATCCGAATGAAGTTCTTGTATACAGTTGTGTGAAACCTCGTCTACCAGTTGTTCAGGGTTGCACAAGTCGATCTATCTGTGCTGTTTGTGCAACCACACCCCCCATCAATAATTACAGTGCACTTGCTTGTGGGCACTTTTTTTGCAATGAATGTTGGGCAATGCATTTTGAAGTTCAAATCATGCAAGGAGTCTCCAACACTATACAATGTATGGCTCAAGATTGTGAAGTAAGAGTACCAGAGGACTTTGTTTTATCTCATGTAACAAAACCAGCTTTAAGAGAACGCTATCAACAATTTATGTTCAAAGATCATGTTAAATCCCATCCTCAGCTCCGATTTTGTCCAGGACCTAATTGTCAGTGGATATATCGTGCCTGGGTCCGGGAAGGTGCTCGTCGTGTTGAATGTCAAGGCTGTGAAATGCTAACTTGTTTCTCATGTGGGGCTCCACATCATGCTCCCACTGACTGTATTACTATAAGGCGTTGGCTGACAAAGTGTGCCGATGATTCTGAAACTGCCAACTATATTAGTGCTCATACAAAGGACTGTCCTAAATGTCAAATCTGTATAGAAAAAAATGGTGGTTGTAATCATATGCAATGTGGTGCTTGCCGACATGATTTTTGTTGGGTATGTCTTGGCGACTGGGGTTATCATGGATCAGAGTACTATGAGTGCAGTCGCTATAAGGAAGACCCTAATTCTGTAACAGATAGTCAACAAGCCCAAGCAAAGGAAGCTTTAAAAAAATATTTGCATTACTATGAAAGATGGGAAAATCATGCAAGGTCTTTAAAACTAGAGGAGCAGACACTTGCAACTTTGAAAAGCCGAATAAATCAAAAGGTAATGGCTGGGGAGGGCACATGGATTGATTGGCAGTACCTTTGGGATGCAGCTAGACTCTTGAAGCGGTGTAGATACACACTTCAATACACTTACCCATTTGCATACTATATGGATATAGGGCCCAGAAAAGAATTATTTGAGTATCAGCAGGCACAACTGGAAGCAGAAATAGAGAATTTATCTTGGAAAATAGAAAGGGCTGAAACCACGGACAGGGGAGATCTTGAAAATCAAATGGATATTGCTGAAAAGCGAAGAACAACATTACTTAAAGACTTCTTAGAATTTAATATTAGTATTGCATCTGGTAGTAAAATATAA

Protein sequence:

>DPOGS203689-PA
MSAESDMEYSDNDGDDYDYYGGQDDCDMEAVDRSKSDPEYFQYTCLRVEEVEKLLNESIELLSNSLQITPSLAKVMLHAYEWNAQEIIKKYNENPNEVLVYSCVKPRLPVVQGCTSRSICAVCATTPPINNYSALACGHFFCNECWAMHFEVQIMQGVSNTIQCMAQDCEVRVPEDFVLSHVTKPALRERYQQFMFKDHVKSHPQLRFCPGPNCQWIYRAWVREGARRVECQGCEMLTCFSCGAPHHAPTDCITIRRWLTKCADDSETANYISAHTKDCPKCQICIEKNGGCNHMQCGACRHDFCWVCLGDWGYHGSEYYECSRYKEDPNSVTDSQQAQAKEALKKYLHYYERWENHARSLKLEEQTLATLKSRINQKVMAGEGTWIDWQYLWDAARLLKRCRYTLQYTYPFAYYMDIGPRKELFEYQQAQLEAEIENLSWKIERAETTDRGDLENQMDIAEKRRTTLLKDFLEFNISIASGSKI-