Monarch geneset OGS2.0

DPOGS206094
TranscriptDPOGS206094-TA1758 bp
ProteinDPOGS206094-PA585 aa
Genomic positionDPSCF300028 + 137158-141768
RNAseq coverage266x (Rank: top 40%)
Annotation
HeliconiusHMEL0121073e-15869.18% 
BombyxBGIBMGA006864-TA2e-11770.76% 
DrosophilaCG9426-PA7e-15649.91% 
EBI UniRef50UniRef50_Q9VK211e-15349.91%CG9426 n=19 Tax=Endopterygota RepID=Q9VK21_DROME
NCBI RefSeqXP_001657135.12e-16349.04%actin-binding protein ipp [Aedes aegypti]
NCBI nr blastpgi|1571376935e-16249.04%actin-binding protein ipp [Aedes aegypti]
NCBI nr blastxgi|1571376933e-15749.04%actin-binding protein ipp [Aedes aegypti]
Group
Gene OntologyGO:00055151.6e-26protein binding
KEGG pathway 
InterPro domain[1-586] IPR0170961.7e-167Kelch-like protein, gigaxonin
[374-585] IPR0159164.5e-55Galactose oxidase, beta-propeller
[18-137] IPR0113337.7e-34BTB/POZ fold
[143-245] IPR0117052.1e-31BTB/Kelch-associated
[31-136] IPR0130691.6e-26BTB/POZ
[41-138] IPR0002105.3e-23BTB/POZ-like
[423-469] IPR0066527.5e-12Kelch repeat type 1
Orthology groupMCL15049 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206094-TA
ATGTCTTCTAGTATTTATGATAAAATAATTAATAAAGAGGGCTCAAGGCCTTATAAATGTTGCGAATATGCAAGCAAAGTTTCTTTAAACCTTAATCACTTCAGACGAGATGGAAGATTCTGTGACATAGACCTTATATCCGGAAAAACAATAATTAGGGCCCATCGAGTTGTATTAGCAGCGAGTTGCGAGTACTTTGATGCTATGTTTAATGAAGGCTTTGAAGAAAGTCAGAAAGGTAGAGTTGTGCTGCCTACTGTACCTCCAGGAATACTTCCCATGATTATTGACTTTATTTATACTGGAGAGATTTCTATAGATAAAGCTAGTGTGCAGCACTTACTTATCGCAGCTGATATGTTTCAATTACGAGAATTAGTTAGAGGATGTGGAGACTTTTTGAAAAGGGAACTACATCCATCAAACGCTCTTGGTATATTTAGGTTTGCAGAGACACACAATTGCACTGAATTAGCTGAAGAAGCACTGGGCCATGCACAGGCCAATTGGAATCTTGTTGCAAATGGGGATGAGTTACTGGAGTTACCTCTGCAACAACTGATTACTTTATTATCATCGGAGCAGCTTGAAGTCCATAATGAAGCTCAGGTTCTCCATCCTGCATTGAAATGGCTTGAACATGATCCCGCAACACGCAGAAGGCACTGTTTTGAAGTTCTCAGACATGTTAGACTGCCACTTATATCACCACAAATCTTGGATGATGTCATTAAGAATGTGCAAGATCCATCCATAGCAGTGGCTTTAAAAAACGTTAGAGTTGATATGAAATCAGGTCGTGGTGCTCTGGTATGTTTATCAGCTGAACCCCGTGCTCGGGCTCGTCGTATGCTGGTGGTGGCCGGCGGGTCCTGTCACGACGCGGCCCCACATCCACCACATTCAACTGATAATATACTGTCATCAGCGCTCAAGTTTGATCTCCATAAGAGGGAGTGGGAGGAACTATCTCCTATGGGAATAGCTCGTATACAACCGGGAGTAGCGAGCCTGGGTGGAAGGGTGTACGCCGTTGGCGGAGAACAAGGCAGCCAGATATTAGCCAACGGAGAGGTGTATGATCCACAGACTGATAAATGGTCATACATTGCATGTATGAAAGAGGCTCGCTGTGAGTTCGGTCTAACAGCTTGGAAAGGCAACTTGTACGCTTTCGGTGGCTGGGTGGGATCAGAAATGGGCGCATCCGTCGAGGTCTATGATCCCGTATCTGACGAATGGACACTCATAGACAGGATGCCTGAACCGAGATTTGGGATGGGCGTTGTTAATTTTGAAGGGTTAATCTATGTAGTGGGTGGTTGTACCCACACATGGCGTCACACCCGGGATCTTCTCTGCTATCATCCTGCGTCTCGCAAGTGGCGCCCCCTGGCTCCCATGCGTCACGCCCGCTCGCAGGCTGCGGCCGTTGTACTGGGAGCGCATCTTTACGTCATAGGAGGAAACGCACCGAGACGGACCGTACTATCATCAGTGGAAAGATACAGCTTCGACGACGATTCATGGGAAGAAGTTGGTAGCCTGGTGGAGGCGAGGGCGGGGTGTGCTGCGGGCGCGGCGGATGGGCTGCTCGTGGCGGCCGGCGGGGACAGCGAGTGCGGCGGCAAGAGGGACTTCTACCGCGCCCGCACCACGCTCGCCTCCGTAGAAATATACGACCCGACACGAGATACGTGGACGTCCACGACATCTCTGCCTCATTCACGAGCTGAAGCTGGTGCCGCCTTACTCTGA

Protein sequence:

>DPOGS206094-PA
MSSSIYDKIINKEGSRPYKCCEYASKVSLNLNHFRRDGRFCDIDLISGKTIIRAHRVVLAASCEYFDAMFNEGFEESQKGRVVLPTVPPGILPMIIDFIYTGEISIDKASVQHLLIAADMFQLRELVRGCGDFLKRELHPSNALGIFRFAETHNCTELAEEALGHAQANWNLVANGDELLELPLQQLITLLSSEQLEVHNEAQVLHPALKWLEHDPATRRRHCFEVLRHVRLPLISPQILDDVIKNVQDPSIAVALKNVRVDMKSGRGALVCLSAEPRARARRMLVVAGGSCHDAAPHPPHSTDNILSSALKFDLHKREWEELSPMGIARIQPGVASLGGRVYAVGGEQGSQILANGEVYDPQTDKWSYIACMKEARCEFGLTAWKGNLYAFGGWVGSEMGASVEVYDPVSDEWTLIDRMPEPRFGMGVVNFEGLIYVVGGCTHTWRHTRDLLCYHPASRKWRPLAPMRHARSQAAAVVLGAHLYVIGGNAPRRTVLSSVERYSFDDDSWEEVGSLVEARAGCAAGAADGLLVAAGGDSECGGKRDFYRARTTLASVEIYDPTRDTWTSTTSLPHSRAEAGAALL-