Monarch geneset OGS2.0

DPOGS208094
TranscriptDPOGS208094-TA1527 bp
ProteinDPOGS208094-PA508 aa
Genomic positionDPSCF300395 - 60727-62888
RNAseq coverage670x (Rank: top 19%)
Annotation
HeliconiusHMEL0160811e-12445.28% 
BombyxBGIBMGA001814-TA6e-4042.34% 
DrosophilaCG5003-PA8e-1436.27% 
EBI UniRef50UniRef50_UPI00022CA3781e-2047.96%UPI00022CA378 related cluster n=2 Tax=unknown RepID=UPI00022CA378
NCBI RefSeqXP_395578.33e-2146.94%PREDICTED: similar to CG3731-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|3320233734e-2048.98%F-box/LRR-repeat protein 12 [Acromyrmex echinatior]
NCBI nr blastxgi|3234540016e-2430.74%hypothetical protein AURANDRAFT_71291, partial [Aureococcus anophagefferens]
Group
KEGG pathway 
Orthology groupMCL26678 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208094-TA
ATGACCATCGACAAGTCCTTCCTCTTCAAGATACAGGACTACCTGCCGGCGCTCGAGAGCCTGGACGTGTCGGAGTGCGAGTGGATGGACCCGGCGACGCTCCTGCCGCTCAGCAAGCTGCCGGCCCTCCAGGAGCTGTTCCTCAGGGACTGCCACAAGCTGGCGGAGTTCGTGGCGTACGCCTCCCTCACCGCCAGATACGGGTTCAGGACACTGAAGGTGTTGGACCTGCGAGGGTCTCCGGTCGGGGACTCGGAGGTGTCGGCGCTGGGCTGGCTGCCGCAGCTGGAGCGTCTGTGGCTGGCCGCGGCGCGCGTCGAGAACAGGAGCCCGCACGCTCACTACGCAGACGACGCAGAGACCTGGCACACGGAGCTGGAGCACTGGGAGACGGCGGAGCGGGAGTACTTCAAGTCTAGACCAAGCAACGGAACACTCGAAGAAACGACGCCAGGCTGTAAGTATGACGGCGAGAAGGAGGCAGCGCGCCAGGAACAGAGAACAGACAGAGCCACATGTGACGACAGAGGGGAGAAGAAAGAGAACAAGAAGAGAAAAACGAGCGACGAGATGAACGGCGGGGAGGGAGGAAAGAGGGCGCGGGTGGGGGGAGGGGCGGAGGCGAACGGAGAGAGGGCGGGCAGAGACGGCAGAGACCGGGAGAACAACAGACACAGGAACAGCCCGGACTGTATCGTGGACTGTATTAAGATAGACCAGTTGATAGACGTGGGCGAGCTGAGGAGGGGCCGGCAGAACGAGGTCATCGTCATAGCCAACTCCGCCAAGGTCAGGAGTCCCACCAGGCCTGCGCCCCACGACCGGTACGTGTGCCTCAGGAGGAGCCGGGACCGGGATGGAGGGCAGGGGGAGGGAGGGGAGGGGAGCGGCAGCGTCAACCAGGGGGCGAGCACCTCCAGGGATCACAAGGATGGAGACGACACGGACAACGAGGACGACGCCAGCGACAACGGCAACGAGAAGTACGTGCACTTCAGGCAGAACCGCGAGCCGGTGTTCGTGGACTGCGCGCCGCCCGAGCCGCCGCCGCCCGTGCAGTACCAGATCGAGCCGCGGCACCACGTGCTGTACGTGAGCGTGGGCCCGCAGGTCAACACGTACCGCTTCCCGCGCGACTCGGCGGACCTGGAGCGCCAGGTGTGCCTCAGCCTGGCGCCCGCGCACGTGGACTCCAGCTCGCTGGTCACGGACTTCGCGATCCGGCGCTTCGGGCGCGCGGACGGCGAGGACGTCAACATCATCCACATAGGACCCAACGGGCCCATGCTGGTGGGGCAGAGCGCCGGCTCGCGGCCCGACCGCTCCAACCTGCGCCTGCTGTCCGTCACGGGCTACCGGAACATCACGGACCGCAGCCTCTCGCACCTGGCCTCCGCCGCGCCGCGCCTCGCGTCGCTGGACTTCCGCGAGACCAACGTCAGCGAGGCCGGCGCCAGGAGCTTCCTCGCCCTGCGCCCCGACTGCGAGCTGGTCTACAGCGCCTTCGTCGACGACAAGGACAACTAG

Protein sequence:

>DPOGS208094-PA
MTIDKSFLFKIQDYLPALESLDVSECEWMDPATLLPLSKLPALQELFLRDCHKLAEFVAYASLTARYGFRTLKVLDLRGSPVGDSEVSALGWLPQLERLWLAAARVENRSPHAHYADDAETWHTELEHWETAEREYFKSRPSNGTLEETTPGCKYDGEKEAARQEQRTDRATCDDRGEKKENKKRKTSDEMNGGEGGKRARVGGGAEANGERAGRDGRDRENNRHRNSPDCIVDCIKIDQLIDVGELRRGRQNEVIVIANSAKVRSPTRPAPHDRYVCLRRSRDRDGGQGEGGEGSGSVNQGASTSRDHKDGDDTDNEDDASDNGNEKYVHFRQNREPVFVDCAPPEPPPPVQYQIEPRHHVLYVSVGPQVNTYRFPRDSADLERQVCLSLAPAHVDSSSLVTDFAIRRFGRADGEDVNIIHIGPNGPMLVGQSAGSRPDRSNLRLLSVTGYRNITDRSLSHLASAAPRLASLDFRETNVSEAGARSFLALRPDCELVYSAFVDDKDN-