Monarch geneset OGS2.0

DPOGS200304
TranscriptDPOGS200304-TA1695 bp
ProteinDPOGS200304-PA564 aa
Genomic positionDPSCF300026 - 249862-256607
RNAseq coverage945x (Rank: top 14%)
Annotation
HeliconiusHMEL0029139e-10574.38% 
BombyxBGIBMGA005626-TA4e-15066.29% 
DrosophilaCG7368-PB2e-5161.19% 
EBI UniRef50UniRef50_D2A5492e-9344.76%Putative uncharacterized protein GLEAN_15472 n=2 Tax=Tribolium castaneum RepID=D2A549_TRICA
NCBI RefSeqXP_970666.12e-9545.35%PREDICTED: similar to AGAP001269-PA [Tribolium castaneum]
NCBI nr blastpgi|910844695e-9445.35%PREDICTED: similar to AGAP001269-PA [Tribolium castaneum]
NCBI nr blastxgi|910844694e-10546.11%PREDICTED: similar to AGAP001269-PA [Tribolium castaneum]
Group
Gene OntologyGO:00036762.8e-10nucleic acid binding
KEGG pathway 
InterPro domain[404-435] IPR0130872.8e-10Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL18306 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200304-TA
ATGTGCGATATGCAGAAAATGCCAACAAGAGACGATGCGTCTTGCGACGAGGACTCAGATATGAATTTCACTACATTCTCGGCTGTGGGTGCAACGGGAACACATTTTGTCACAGCGGGTGGACAGTTGCCGGTCACTAAATTACAACAGAATGTCTCCAATAACGGCACACCCCTGTCTCAGGTGGTGGGAATGGTGAGCGGAGTGAATATGAGTGAAGGTGTGCAGTATGTACGAGCGATTGATTCATCGTCTCTCCAGGCGGGGCCTCAACTTATATCAGTGCCAATTGCCTTACCTGGAACAAAACCTGGTGACCCCCAGCCCACAGTGCAGATTCAGGTCCTCAGTCCAAATCTAACGCTGCAACAGCAACAACCAAAATATCAGATGCAAATACCAATTCAAGGATTTCAACAAGGTGGAGCGGTACTCACAGTGGCGTACTCGCCGGATGGGAACGAATCTGGTGGGATACAACTCATAGGAAACACATTACCAGAAGGTCTACAAGTGCTGGCGTTGCCTCAGGAGATGCAATTGATACAACAGGAAAATAAGGAACAAGTACAGCAAAACATACAACATCAGGTGTTCATAACGCCCAACAACCAGATCGTGATCAACGGCCCCGACAAGTCCGTGAACAACAACAACGACGGTGACGTCACCAACATCGTCATCAAGGAGGAGTGTAATGACGACAACAACGATGACAGTTCCCAAGGCACGGACACAGATGGCGTCCCCTGGCATATCGGTCAACCGTCGCAGGCTCTCGTTAAATACCTCAACACTCTCGCGCCACAGCAAACGCAAGCGTTGCCGGTGTCCTTACAGCAGTTCCTGAGACTGAACCCCACCGAGACTAAGAAGGTTGAGGCTGAGGACATTGACATGACCCCGGAGGAAGATAAGAAAGAGGTGATCACTGAAGCCGTCTTAGAAGAAGATGGTACTTTACGAGTTCAGACAAAGAAGAAGAAGAAATACAAGAAGAAAGCAGCTAAGCCGGCGCGGCCGAAGCCGGGGCAGGTGATTATAGCGACCGCTGCCGACGGTACGCCCGTGTACTGCTGCCCGCAGTGTGACATGGCCTACCCCGAGAAAGATCAGTTGGAGATGCATCTCTCCGTACACAAGATCGAGAGACGATTCATATGCGGAATATGCGGGGCTGGGCTCAAACGTAAAGAGCACCTTGAGAGACACAAGCTGGGTCATAACCCAGAGCGGCCGTACGTGTGCGGCGCCTGCGGGAAGGGCTTCAAGAGACGGGAACATCTCAACCTGCACGCTGTCATACACTCGGGCGTCAAGACGGAGATGTGCGGGGAGTGTGGGAAAGGATTCTATCGCAAGGATCATCTCCGTAAGCACACACGTTCACACGAGAGCAAGAGAGCGAGGGACGAGGCGAACAACGACTGTATGGAGACCAAGACTGGCAACACCAACGCTAACGTTAACATCACAAACACCAACACCATCATGCCGGAGATTACGATACACGTGCCGACAAGTTCTAATATGCAGGTCCCCGTTCAGATCAACATCCCTCAGCACGTGATGTCGTCTCTGGTGGGACAGACGCACACACACACGCACACCAACACACACACGCACATGCACGCCCACGACGAGGCGGGGGATGCGCACGCGCAGCTCGACGCGCTCCTCGCGCAGCACACGTGA

Protein sequence:

>DPOGS200304-PA
MCDMQKMPTRDDASCDEDSDMNFTTFSAVGATGTHFVTAGGQLPVTKLQQNVSNNGTPLSQVVGMVSGVNMSEGVQYVRAIDSSSLQAGPQLISVPIALPGTKPGDPQPTVQIQVLSPNLTLQQQQPKYQMQIPIQGFQQGGAVLTVAYSPDGNESGGIQLIGNTLPEGLQVLALPQEMQLIQQENKEQVQQNIQHQVFITPNNQIVINGPDKSVNNNNDGDVTNIVIKEECNDDNNDDSSQGTDTDGVPWHIGQPSQALVKYLNTLAPQQTQALPVSLQQFLRLNPTETKKVEAEDIDMTPEEDKKEVITEAVLEEDGTLRVQTKKKKKYKKKAAKPARPKPGQVIIATAADGTPVYCCPQCDMAYPEKDQLEMHLSVHKIERRFICGICGAGLKRKEHLERHKLGHNPERPYVCGACGKGFKRREHLNLHAVIHSGVKTEMCGECGKGFYRKDHLRKHTRSHESKRARDEANNDCMETKTGNTNANVNITNTNTIMPEITIHVPTSSNMQVPVQINIPQHVMSSLVGQTHTHTHTNTHTHMHAHDEAGDAHAQLDALLAQHT-