Monarch geneset OGS2.0

DPOGS208156
TranscriptDPOGS208156-TA1389 bp
ProteinDPOGS208156-PA462 aa
Genomic positionDPSCF300058 + 43835-50658
RNAseq coverage1108x (Rank: top 11%)
Annotation
HeliconiusHMEL0110805e-15664.39% 
BombyxBGIBMGA013773-TA4e-5262.13% 
Drosophilap47-PA7e-5242.47% 
EBI UniRef50UniRef50_E2AKL71e-6443.88%NSFL1 cofactor p47 n=8 Tax=Formicidae RepID=E2AKL7_CAMFO
NCBI RefSeqXP_393054.25e-6745.71%PREDICTED: similar to p47 protein isoform a [Apis mellifera]
NCBI nr blastpgi|3504005685e-6646.07%PREDICTED: NSFL1 cofactor p47-like [Bombus impatiens]
NCBI nr blastxgi|1565542368e-6836.66%PREDICTED: NSFL1 cofactor p47-like [Nasonia vitripennis]
Group
Gene OntologyGO:00055159.8e-15protein binding
KEGG pathwayame:4095471e-66 
 K14012 (SHP1, UBX1, NSFL1C)maps-> Protein processing in endoplasmic reticulum
InterPro domain[260-358] IPR0129894.7e-24SEP domain
[385-459] IPR0010129.8e-15UBX
[1-46] IPR0090605.6e-14UBA-like
Orthology groupMCL12576 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208156-TA
ATGTCTGGGAATAAGGAAGACACGCTGCGACATTTCTGTGACGTTACCGGCGCCGATGAAAATCGAAGTAGATTCTTTTTAGAATCATCAAACTGGCAATTAGAGGTTGCTCTCTCAAGTTTTTACGAACACGGTGGTCATATAGAAGAAGCGCCTAGTGCTTCTCCTGCGGTTGGCGGTGTACAGCCTATGTCGGATAGTGATATGGACTCACCTCCACGGTCACCAGTGCAGGCCAAAAAAAAAGATAAAAAGAAATCCAACCCTCACTTCGCGACTCTGGATTCGCTGCAACAAGAAAGTTCTAGTGAAGATGAAGAGATATCAGTGTGTGGACGGAATAATAGTGTGAGAGTAGAAATTGTTCAAGAATTCTTTTGTCATCCACCATCATGTTCTAAAAAGAAAGAGGATTGTGATAAATGTAACTCACAAAGAGATCAATTAAATGAAATGCTAGAAGAGATACAGAGACTTAATAATGAAATTAATAGACTGTTGGTAGAAAATCTTGAATTGAGCAAGGAAATATCACAAATGAGAGAAAATTCAGCTCAAGCTTTCTACGCTGGAGGTTCTGAGAGGTCCGGACAACAGATTCTTGGACCGGGGAAAGGCAGGAAAGACATAATAACGGAAATGTTCAAAAGTGTCAGGGAACAGGGTGCAGTTGTTTTCGACGAGGAACAATCGTCGACTAGTAGGGGGCGCTCGGGGCTATTCGGTGGTGTAGGGTACAGGCTAGGTCAGACAACTGATGATCATGAACAGGTCACACCTGGAAGTGCTGCACAGCAACAGGAGGGTCCTCGCGTGGTCCGCCTGCGTCTATACCGCGCCGGGTTCGTGGTGGACGACGGACCCCTGCGGCTGTACTCGGATCCTGAACACGCTCATTTCCTCAGCTGTATACGACGAGGCACAATACCTCCCGAGCTCCGCGGACACGGAGAGGTGAAGCTCAGTCTAGAGGACAAGCGACATGAAGAGTGTCCTCGCCCCGCCTCCAAACACATGCCCTTTGGGGGCAAAGGTCACATGTTAGGCAGCCCCACTCCGCCGACGGTGGGCGCCACGTGTCCGGTGGCGGCGAGCGGCGGAGACCGAGAGGCGAACATGCGAGCGGCTCAGAACGACGTGCAGCTAGACGACACCGCTCCCGTCACCTCCGTACAGTTCCGCCTGGCGGACGGCAGTCGCCTGACGGGTCGCTTCAACCACTCTCACACCGTGGCGGACCTCGCTCGGTTCGTGGCTCGCGCGGAACCGACGTACCAGCTGACGTCGTTCGCCTTGCTGGCCGCCTTCCCCAGGGTGGAGCTGGACGAGAGCCAGACCCTGGCGCAGGCGGACCTGTTGAACGCCACCGTGCTGCAGAGACTCAAATGA

Protein sequence:

>DPOGS208156-PA
MSGNKEDTLRHFCDVTGADENRSRFFLESSNWQLEVALSSFYEHGGHIEEAPSASPAVGGVQPMSDSDMDSPPRSPVQAKKKDKKKSNPHFATLDSLQQESSSEDEEISVCGRNNSVRVEIVQEFFCHPPSCSKKKEDCDKCNSQRDQLNEMLEEIQRLNNEINRLLVENLELSKEISQMRENSAQAFYAGGSERSGQQILGPGKGRKDIITEMFKSVREQGAVVFDEEQSSTSRGRSGLFGGVGYRLGQTTDDHEQVTPGSAAQQQEGPRVVRLRLYRAGFVVDDGPLRLYSDPEHAHFLSCIRRGTIPPELRGHGEVKLSLEDKRHEECPRPASKHMPFGGKGHMLGSPTPPTVGATCPVAASGGDREANMRAAQNDVQLDDTAPVTSVQFRLADGSRLTGRFNHSHTVADLARFVARAEPTYQLTSFALLAAFPRVELDESQTLAQADLLNATVLQRLK-