Monarch geneset OGS2.0

DPOGS214715
TranscriptDPOGS214715-TA2178 bp
ProteinDPOGS214715-PA725 aa
Genomic positionDPSCF300022 - 222184-229263
RNAseq coverage389x (Rank: top 31%)
Annotation
HeliconiusHMEL0080920.092.00% 
BombyxBGIBMGA008387-TA9e-3426.40% 
DrosophilaCpsf100-PA0.055.95% 
EBI UniRef50UniRef50_B4JTB60.056.10%GH10247 n=2 Tax=Drosophila RepID=B4JTB6_DROGR
NCBI RefSeqXP_394940.20.058.97%PREDICTED: similar to Probable cleavage and polyadenylation specificity factor, 100 kDa subunit (CPSF 100 kDa subunit) [Apis mellifera]
NCBI nr blastpgi|3227832520.059.40%hypothetical protein SINV_80021 [Solenopsis invicta]
NCBI nr blastxgi|3838527820.058.97%PREDICTED: probable cleavage and polyadenylation specificity factor subunit 2-like [Megachile rotundata]
Group
KEGG pathway 
InterPro domain[243-368] IPR0227121.6e-23Beta-Casp domain
Orthology groupMCL12736 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214715-TA
ATGACTTCTATTATTAAATTCCATTGCCTCTCAGGGGCTGGAGACGAGTCTCCTCCCTGCTACGTGTTGCAAGTGGATGAATTTAAATTCCTCTTGGACTGTGGATGGGATGAAAAATTTGATATGGATTTTATAAAGGAACTTAAAAGACATGTCAACTCTATAGATGCAGTCCTACTGTCACATTCAGATCCCCTTCATCTCGGGGCCCTACCATATGCTGTCGGACAGCTCGGTTTAAACTGTCCTATATATGCCACCCTCCCAATATACAAGATGGGCCAAATGTTCATGTATGATCTCTACCAATCACATAAAAATGTCTCCGAGTTTGATCTGTTCACATTAGATGATGTGGACACAGCATTTGATAGAATCACACAACTTAAATATAATCAGAGTGTTGATATGAAGGGTAAAGGGCTAGGCCTGCGTATAACTCCACTGCCAGCCGGACACCTCCTGGGCGGAACTGTGTGGCGTATTGCAGCCCCAGGGGAAGAAGACATAGTGTACGCACCAGACTTCAACCACAAAAAGGAGCGGCATCTGAATGGGTGCGAGATTGAGAAGATTATGAGGCCTTCATTACTGCTGCTCGGAGCTATGAATGCTGATTACGTGCAGCAGAGACGGCGGCTAAGGGACGAAAAACTTATGACAACAATCCTTAGTACACTTCGGGGTGGTGGTTCAGTACTGGTGTGTACGGACACCGCGGGACGGGTTCTAGAGCTGGCCCATATGTTGGACCAACTTTGGAGGAACAAGGATTCTGGTCTTGTTGCATATTCTCTGTTGTTGTTGTCCAACGTCAGCTATAATGTTGTGGAGTTTGCCAAGTCACAGATCGAATGGATGAGCGACAAATTGACCCGCGCCTTCGAAGGAGCTAGAAGCAACCCTTTCGCGCTGAGGCACTTGCAACTGTGTCACTCCGTAGTCGAGGTCACTCGGACCCCGGGGCCCAAAGTGGTGCTGGCGTCCTTCCCAGACTTAGAGACCGGTTTCGCAAGAGATCTTTTCCTGCAATGGGCCCCTAATTCACAGAATTCTATAGTACTAACTGCAAGGACCTCTCCGGGGACCCTCGCCAGGGATCTGATTGAGAAAGGCGGTGACCGCACCATAGAATTGACGGTGAGGAGGCGGGTCCGGCTGGAGGGGGCGGAGCTTGAGGAGTTCATGCAACAGAGGGTCAAGGTCAACAACTCGGTCAAAGAGGAGACCGGTGGTATATCATCCGACTCCGAGTCCGAGGGTGAGTTGGAGATGTGCGTGGTGACCGGCAAACACGACATACCGGTCCGGGGGGACGCCAGGCCCGCGGGGTGCTTCAAGAGCAACAAAAGACACCACGCCATGTACCCCTGTACCGAGGAAAGAGCGAGGGCCGACGACTACGGAGAGATTATACGGCCTGAAGACTACCGCCTGGCGGAGGTCGTGGACGCCGAGGGAGAGATTCGGGACGTGCCGCCCGCCCCGACACACACACAGGAACCGGAAGAGGAGATAACAGAGATCCCGAGTAAGTGTATCACGGCGACCAAGCAGCTGCAGGTGAAGGCCAGCATCCAGTACATAGAACTGGAGGGCCGCTGTGACGGAGAGTCACTGCTGCGAGTGGTGGCGGCCGCCAAACCTCGGGCGGTGGTGGCCCTGAGAGCCGGACCTACGGCACTGGCCACCCTCAAAAAGCACTGTGACAGTGAGGGTATCGAGAAAGTCTTCACACCGGGCCGCGGCGACACAGTGGATGCGACCACGGAGTCTCATATCTACCAGGTGAAGTTAACGGACAGTGTGATGTGCGGTTTGTCCTGGCGCTCGGCCGGGGACGCGGAGCTGGCGTGGCTGTCGGCCGTGGTGGCGCAGCCGAGGACCCGGGACACGCCCAGCGAGGAAGTGGCGGATGTGGAGATGATGTCGCTGGAGGCTGCGGAGGGCGTGCCTCACGGCGCGTGGTTCGTGAACAGTGTGAGGCTCTCGGAGCTGAGGGCGGCGCTCGCCCGGAACGGCCTCGGGGCGGAGTTCAGTGCCGGGGCCCTGGAGTGCTGCAACGGAACCATCGCTATACGAAGATTGGAGAACGGTCGCGTCGCCCTCGAGGGAGTGCTCTCTGAGGAGTATTTCAAAGTGCGGGAACTTTTGTACGACCAGTTCGCTATAGTGTAG

Protein sequence:

>DPOGS214715-PA
MTSIIKFHCLSGAGDESPPCYVLQVDEFKFLLDCGWDEKFDMDFIKELKRHVNSIDAVLLSHSDPLHLGALPYAVGQLGLNCPIYATLPIYKMGQMFMYDLYQSHKNVSEFDLFTLDDVDTAFDRITQLKYNQSVDMKGKGLGLRITPLPAGHLLGGTVWRIAAPGEEDIVYAPDFNHKKERHLNGCEIEKIMRPSLLLLGAMNADYVQQRRRLRDEKLMTTILSTLRGGGSVLVCTDTAGRVLELAHMLDQLWRNKDSGLVAYSLLLLSNVSYNVVEFAKSQIEWMSDKLTRAFEGARSNPFALRHLQLCHSVVEVTRTPGPKVVLASFPDLETGFARDLFLQWAPNSQNSIVLTARTSPGTLARDLIEKGGDRTIELTVRRRVRLEGAELEEFMQQRVKVNNSVKEETGGISSDSESEGELEMCVVTGKHDIPVRGDARPAGCFKSNKRHHAMYPCTEERARADDYGEIIRPEDYRLAEVVDAEGEIRDVPPAPTHTQEPEEEITEIPSKCITATKQLQVKASIQYIELEGRCDGESLLRVVAAAKPRAVVALRAGPTALATLKKHCDSEGIEKVFTPGRGDTVDATTESHIYQVKLTDSVMCGLSWRSAGDAELAWLSAVVAQPRTRDTPSEEVADVEMMSLEAAEGVPHGAWFVNSVRLSELRAALARNGLGAEFSAGALECCNGTIAIRRLENGRVALEGVLSEEYFKVRELLYDQFAIV-