Monarch geneset OGS2.0

DPOGS215892
TranscriptDPOGS215892-TA1680 bp
ProteinDPOGS215892-PA559 aa
Genomic positionDPSCF300029 + 198657-213144
RNAseq coverage26x (Rank: top 77%)
Annotation
HeliconiusHMEL0057382e-4671.76% 
BombyxBGIBMGA000248-TA2e-4566.92% 
DrosophilaCG34461-PA5e-3457.69% 
EBI UniRef50UniRef50_UPI000175819B4e-3848.91%UPI000175819B related cluster n=1 Tax=unknown RepID=UPI000175819B
NCBI RefSeqXP_001237794.25e-5450.41%cuticular protein 139, RR-1 family (AGAP006283-PA) [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582956749e-5350.41%cuticular protein 139, RR-1 family (AGAP006283-PA) [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582956741e-5345.05%cuticular protein 139, RR-1 family (AGAP006283-PA) [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00423027.9e-18structural constituent of cuticle
KEGG pathway 
InterPro domain[59-111] IPR0006187.9e-18Insect cuticle protein
Orthology groupMCL17940 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215892-TA
ATGGATCTCAAAGTTATAAAAATTGAATGTTTTTGGCAGATCCTTTTGCTTCTGTCCTGTTTGAGTGTGGTGCTGGCTGTTCCAGTACAATTGGGAGGATATGGAGAAGTTTATTCTCAGGCCCCCCTCATTCATGCTCCAGTAGTCCATGCTGAGCCAGTTTCGTACCCAAAATATACTTTCAACTACGGAGTCAAGGACCCTCACACCGGAGACATTAAAAGTCAAGAGGAGCAGAGGGACGGTGATGTCGTTAAAGGTAGCTACTCGTTAGTCGAAGCTGACGGCACCACTCGCACTGTTCATTATACAGCTGACGATCACACTGGATTTAACGCCATAGTACAGAGATCAGGCCACGCTGTACATCCAGTACATGCTCCAGTGGCTCATTATGCCCCTGCGCCTGTGGTTTTGTTTACAACTTTGTTATGGACTCCAGCAGCACCAGCTCCTCGGCAGAAGAGGCTTAGTTTAGCGGCTCATTCTCCAAGTTACTATTATACAGATGTGGCAGGTCATCCAGGAACGTACTCTTTTGGTTATGACGTATTAGATCCAGAGACTGGCAACACTCAATTTCGTTCTGAGGAGAAGTATCCAAATGGAACAGTCGTCGGTAGCTACGGATATATTGATCCCAAAGGGAGATCACGCCGTTTTAACTATATTGCTGATGAATATGGATACAGATTAATGAACGACGTACCAAAAAAGCAAAACTATCAAAGCACTCCTGAAAACATTAATACGTCAACGGAGGCATCCATCACTTGGAGCCGACCCAGGAAACCAAATAAAAAGAAACAACAGTCAAAGTCGGAAAAACCTTCGAGTTATCCAATGAAAGGACAAGAAGAGAAAACGCATGTACAGAAGAACGAGAAGAAATTTCCTAACGGTACTATTGTGGGAAAATACTCTTATAGAGATAAAGATGGCAATCCAATACACGTTAGATATTACGCTGATGGTTCAAGTTATGGTGTCGAACTGAAGAGTGTTAAAGTTTTTGGATCACAACCAGACAATTTAAAGGAGGCGATTAATTTGGATGTGCCATCAGAACCGTATGAGGAGGCCATAAACAGGGCAAATTCATTGCTTAGTAACAGTTTTGTAAAAAATAAGTCCTATACACCATTTCAAGTTATTAACGCAGTTCCCGAAGAAACTTCCAAACCTAAAAAGCAGAATGATGATTACGAGATATTTTTAGAAAATGACATTAGGCCTTCACAAGATTGCAACAAGGAAAAAATCCGAATTTACACAGATAAGTCCAAGAGGAAAGGCTTACTCATAGCATCAGCGGTTATAGCCTGTGCAAATGCCCGGTTATTCACGTATTTAAGACCAGCGGAACCGCATACAGCGGCCAGTTTTGTGAACGCAGAACCAAAAATTGAATATGCACTCCAACACATACCAGAAGAAGAGCACATTGATTATTATGCGTATCCTAAATACGTGTTTAAGTACGGAGTGAATGACTTTCACACTGGAGACATAAAGACTCATCACGAGAGCAGAGATGGCGATGTCGTCAAAGGTCAGTATACGGTTGTAGAGCCCGATGGTTCTATCAGGACAGTCGATTACACGGCCGATAACCACAACGGTTTCAACGCGGTGGTACATAAGACAGCTCCCATTTCCGCCCACGAAGCCCATCTTTAA

Protein sequence:

>DPOGS215892-PA
MDLKVIKIECFWQILLLLSCLSVVLAVPVQLGGYGEVYSQAPLIHAPVVHAEPVSYPKYTFNYGVKDPHTGDIKSQEEQRDGDVVKGSYSLVEADGTTRTVHYTADDHTGFNAIVQRSGHAVHPVHAPVAHYAPAPVVLFTTLLWTPAAPAPRQKRLSLAAHSPSYYYTDVAGHPGTYSFGYDVLDPETGNTQFRSEEKYPNGTVVGSYGYIDPKGRSRRFNYIADEYGYRLMNDVPKKQNYQSTPENINTSTEASITWSRPRKPNKKKQQSKSEKPSSYPMKGQEEKTHVQKNEKKFPNGTIVGKYSYRDKDGNPIHVRYYADGSSYGVELKSVKVFGSQPDNLKEAINLDVPSEPYEEAINRANSLLSNSFVKNKSYTPFQVINAVPEETSKPKKQNDDYEIFLENDIRPSQDCNKEKIRIYTDKSKRKGLLIASAVIACANARLFTYLRPAEPHTAASFVNAEPKIEYALQHIPEEEHIDYYAYPKYVFKYGVNDFHTGDIKTHHESRDGDVVKGQYTVVEPDGSIRTVDYTADNHNGFNAVVHKTAPISAHEAHL-