Monarch geneset OGS2.0

DPOGS203856
TranscriptDPOGS203856-TA1554 bp
ProteinDPOGS203856-PA517 aa
Genomic positionDPSCF300010 + 3423726-3426183
RNAseq coverage382x (Rank: top 31%)
Annotation
HeliconiusHMEL0086612e-3265.32% 
BombyxBGIBMGA007599-TA2e-10953.17% 
Drosophilabif-PB1e-0852.38% 
EBI UniRef50UniRef50_D6WCQ91e-1533.87%Bifocal n=1 Tax=Tribolium castaneum RepID=D6WCQ9_TRICA
NCBI RefSeqXP_392295.21e-2034.70%PREDICTED: similar to bifocal CG1822-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|3287772592e-1934.70%PREDICTED: hypothetical protein LOC408761 isoform 1 [Apis mellifera]
NCBI nr blastxgi|2700028738e-2526.07%bifocal [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL30509 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203856-TA
ATGATGGTGGAGACCATTCTCGAGTCCTCACAGGACTTTGGGATGTCGAACGGCACTCTACCACAATGGAAGCGCGAGTTGCTGCAGCGCCGTGCCGCCCGGGCCCGGCCCCCCCTGGCGCCGGTTCCTCCCGCGCCCCCGCCGCCCGTCCCGCCCGCCGCAGACGACGACGAGGAGCTCCGCTACGGGCCCGGCATCGTCAAGCGTCTCAAGTCCAGATACCTCAGCCTCGCCCTCCGAGAGGCGCCGCGCCGACGACCCTCTGTCCTCCGCCGCGCCGCATCTCTCGAACACCTCCTGGACGAGCGTCCTCCGCCCGCCCCTCGCCAGCACGCTCGGCCCGCCCGCCCGGTGTCCATGGCGGTCCCCAGCTCCGGTCCGCACTCCGCCCCCGTCCCTCGACGTGAATCCGTGAAGCGCGCGCGGTCCGTGGACGCCCTCAGCCGCCTCGACTCCCGGGACGAGTCTCCTCATGTTCAGCTACAGTCGTCCCTCCAGCCGCCTCCTCGGCCACCGCCGCCGCCGCCCTCACCGCCGCCCCTCACGCCGAGGGCCACCCGACCTCCTCGCCGCCCGGCGCCTCTGCTCCGGGAGGCCGAGCGCCCCCCGGCTGACCTGGTCCGGTCGACCCTGAGAAAGTTCGAGTCCGCCCCTCCGCGCCGCACGGCCCCGGCGGCTCGGGTGTCTGCCGTGCTGCGGGGCCTCGAATCCCTGCCGCGAAGCTCGACCCCGGAGCCCCGAGGACGTTCGCCGAGCCCGGCCGCCACGGACCTCTCCGCCATAGACGAGCCGGAGACGTCGACGACGCCTGCCCCCAGCGAGACCAAGCAGGTGTCCAAGCGGGCTCTGGAAGGCATCGCACGGGCGGGGTCGTCCGTGCTCTACTCGTTCACGTCGGGAGGCACGGGCTCGCACTTACCGCCGCTCGCCGCCTGCAATACAGTGCTGGTGGGTCGAGCGAGACGAGTGGGCGTCATCAGGCCCATGCCGGCCCAGACCCTCGAGCCGGCCGAGGACGAGACGGACGACAAGCCGGAGGACGTGCGGATCGTAATATCGCCGCCGCCCTTAGAGGACGACTCGCAGAATAAGGAGCGGACGACCTCCGCCCCGCACGACCCTCGACCGACAACACCGACAGAGGAAAGTGTTCCGACGACTCCCGACCCCGCCGTGGCGCAGGACGCGGACGACGCGAGGCCCGCCCCGGCACCTCAGACAGAGACGCCAAAGATACCTCCTCTGAAACCGACGATTAATGGCCACGCTACCTCCGCGAAATCTGACGAATCTAAAGTCGACAGATCCTTCTCCAAGATAGGGGCGGCCGGGATAGAGCGAGCCTGGACCGGCGACCTCGAGAAGAAACAGAAAGGAGACGGCGAGGTGAAGAGTCCCTGGGGCGCCACCCGAGCCCCGCGGCCCCCCGCGCCCGCCGCCACCTCCGTGGTGTTCAACTTCTCCAACCGGAAGGAAGTCCCCGACTACGTCGAGAACGACGGCACCATCAGGAGAGTCAACAGGAGGAACATATTCAAGGTCAGTATTTTATAG

Protein sequence:

>DPOGS203856-PA
MMVETILESSQDFGMSNGTLPQWKRELLQRRAARARPPLAPVPPAPPPPVPPAADDDEELRYGPGIVKRLKSRYLSLALREAPRRRPSVLRRAASLEHLLDERPPPAPRQHARPARPVSMAVPSSGPHSAPVPRRESVKRARSVDALSRLDSRDESPHVQLQSSLQPPPRPPPPPPSPPPLTPRATRPPRRPAPLLREAERPPADLVRSTLRKFESAPPRRTAPAARVSAVLRGLESLPRSSTPEPRGRSPSPAATDLSAIDEPETSTTPAPSETKQVSKRALEGIARAGSSVLYSFTSGGTGSHLPPLAACNTVLVGRARRVGVIRPMPAQTLEPAEDETDDKPEDVRIVISPPPLEDDSQNKERTTSAPHDPRPTTPTEESVPTTPDPAVAQDADDARPAPAPQTETPKIPPLKPTINGHATSAKSDESKVDRSFSKIGAAGIERAWTGDLEKKQKGDGEVKSPWGATRAPRPPAPAATSVVFNFSNRKEVPDYVENDGTIRRVNRRNIFKVSIL-