Monarch geneset OGS2.0

DPOGS214241
TranscriptDPOGS214241-TA1461 bp
ProteinDPOGS214241-PA486 aa
Genomic positionDPSCF300014 + 1170244-1175047
RNAseq coverage549x (Rank: top 23%)
Annotation
Heliconius% 
BombyxBGIBMGA005962-TA2e-11285.11% 
Drosophilambl-PH2e-7674.18% 
EBI UniRef50UniRef50_D6WKH98e-9969.85%Putative uncharacterized protein n=3 Tax=Endopterygota RepID=D6WKH9_TRICA
NCBI RefSeqXP_001812946.14e-9865.08%PREDICTED: similar to muscleblind CG33197-PA [Tribolium castaneum]
NCBI nr blastpgi|2700075523e-9869.85%hypothetical protein TcasGA2_TC014149 [Tribolium castaneum]
NCBI nr blastxgi|2700075529e-10070.11%hypothetical protein TcasGA2_TC014149 [Tribolium castaneum]
Group
Gene OntologyGO:00082701.1e-05zinc ion binding
GO:00036761.1e-05nucleic acid binding
KEGG pathway 
Orthology groupMCL15809 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214241-TA
ATGTGGTTGGGTTTAAACGAATTCTTATACTTCGCCATCATAATTGTCCAAACAATATTTTTGCTTCCAGAAAATGGACGGTGTAACCGAGAGAAACCACCCTGCAAGTACTTCCACCCCCCGCAGCATCTTAAGGATCAGCTGTTGATTAACGGTCGGAATCATCTAGCATTAAAGAACGCCCTGATGCAGCAGATGGGCCTTACGCCGGGCCAGGTGCTTCCTGGCCAGGTCCCAGCAGTTGCAACATCACCTTACTTATCGGGTGTGCCCGGAGTGGGCTCGACGTATGCTCAATACTATGCGCCGCAGCTGGTTCCGGCTGTGTTGGGTCACGACCCGTCCGCCGCAGCCGCCTCGCCACTAGGAGTCATGCAGCAGCCCGTATTGCAGCAGAAACTGCCGCGTACTGACCGTCTTGAGGTGTGCCGGGAGTTCCTGCGCGGCGCGTGCAAGCGGGCCGAGTCCGAGTGTCGCTTCGCGCACCCGCCGCCACCGGTGGTCGCTCACGACGACGGCTGTGTCACGGTGTGCATGGACGCCGTCAAGGGCCGCTGCGTCCGCGACCCCTGCCGCTATTTCCACCCTCCCCTGCACCTGCAGGCGCACCTTAAGGCGCAGGCGCGCGGCGCGATGGACATGAAAAGCGTCGGTTCCTTCTATTACGATAACTTCGCCTTTCCCGGTGTGGTCCCGTACAAAAGACAAGCTGCTGACAAAGCCGGAGTTCCCGTATACCAGCCGGCGACTACTTACCAGCAACTGATGCAGCTGCAGCAGCCATTCGTGCCCGTGTCATGTGAGTACCCCGCGCCCGCCTCGTCCGCCCCGCCGGCCGCGGTGACTGCGGTGTCGGGCGCCGCAGGACCCCAGACGGCGCCCGCGGCCGTCCGCCGCGCAGCCCGCTCCTCCCGCGCCCGCCTCGCCACCCGTCTCCGTTGCCGACGCGGCTGTACCGGACCCCGCCGCCGTCGCCAAAGAGGTCGCTCACAAGAATTACGCGGCCGCGCTCGCGCTCGCCGCACAGCACTCCGCCATGGCACACGCGGCCGCCGCCTACACACAACAGGCGTTCAAGGCCCGTGCCGCCATGCCGGGGTTGATGCGCGCACCGCTCATGATGCGGCCGGGTTGGCCCGCGCCGCCCGTGCCCATGCCCGCCTTCTATCAGCAGCCTTACATGTACGCGATGCCCCCGCCCGCGGCACCGTCCGCAGCCGCGGCGGGGGCGGCGGCGGCCGCTGCAGTCAACCCCTACAAAAAGATGAAGACGACCTAAGACGCGCGGGCGGGCCGCGGCCGGAGGGGCGCGGGGTGCGGGGTCTCCGCGTAGAGACTCGTACGTGTGCGATAGTGTGGCCCTGGCCTGTCGGGGCGTTAGCGCTAGGCGTGTTGTCACATTTGTATTGTATTATTGTAAGGACGAAGCGAGCGATTCCCGTCTCGCGGAAGTCTAGGTAG

Protein sequence:

>DPOGS214241-PA
MWLGLNEFLYFAIIIVQTIFLLPENGRCNREKPPCKYFHPPQHLKDQLLINGRNHLALKNALMQQMGLTPGQVLPGQVPAVATSPYLSGVPGVGSTYAQYYAPQLVPAVLGHDPSAAAASPLGVMQQPVLQQKLPRTDRLEVCREFLRGACKRAESECRFAHPPPPVVAHDDGCVTVCMDAVKGRCVRDPCRYFHPPLHLQAHLKAQARGAMDMKSVGSFYYDNFAFPGVVPYKRQAADKAGVPVYQPATTYQQLMQLQQPFVPVSCEYPAPASSAPPAAVTAVSGAAGPQTAPAAVRRAARSSRARLATRLRCRRGCTGPRRRRQRGRSQELRGRARARRTALRHGTRGRRLHTTGVQGPCRHAGVDARTAHDAAGLARAARAHARLLSAALHVRDAPARGTVRSRGGGGGGRCSQPLQKDEDDLRRAGGPRPEGRGVRGLRVETRTCAIVWPWPVGALALGVLSHLYCIIVRTKRAIPVSRKSR-