Monarch geneset OGS2.0

DPOGS207512
TranscriptDPOGS207512-TA1635 bp
ProteinDPOGS207512-PA544 aa
Genomic positionDPSCF300177 - 143648-148383
RNAseq coverage149x (Rank: top 53%)
Annotation
HeliconiusHMEL0222672e-11764.89% 
BombyxBGIBMGA001936-TA5e-15956.05% 
DrosophilaCG8079-PA2e-1344.74% 
EBI UniRef50UniRef50_E2C5924e-8239.31%Angiogenic factor with G patch and FHA domains 1 n=2 Tax=Formicidae RepID=E2C592_HARSA
NCBI RefSeqXP_394532.33e-8339.12%PREDICTED: similar to CG8079-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3504111355e-8438.92%PREDICTED: angiogenic factor with G patch and FHA domains 1-like isoform 1 [Bombus impatiens]
NCBI nr blastxgi|3454958272e-9741.51%PREDICTED: angiogenic factor with G patch and FHA domains 1-like [Nasonia vitripennis]
Group
Gene OntologyGO:00055154.4e-25protein binding
GO:00056221e-12intracellular
GO:00036761e-12nucleic acid binding
KEGG pathwayamr:AM1_39673e-06 
 K01768 (E4.6.1.1)maps-> Meiosis - yeast
    Purine metabolism
InterPro domain[313-438] IPR0002534.4e-25Forkhead-associated (FHA) domain
[305-431] IPR0089842.9e-22SMAD/FHA domain
[511-543] IPR0004671e-12D111/G-patch
Orthology groupMCL14284 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207512-TA
ATGATGGAGAAACCAGATGAAGGTTGTACAATAATTAAAAATTCTGGAAATCACCACAAACGTAAAATGTTTAATTTAAGAAAATTGCGGTTGTCTTTAAAGTATAGGCCTAAAGTTTATGAATTAATATTGAAAATGAGGCAATATATACGAAAGAAAAATGTTTTACTCAATAAGTTCAAGTGCTTAATTAAAGAAAAGAAAATAATCGAAACTGAAAATTCACATACACAAACGAGCACAGAAACTATAAAAAAGAAAGTTTCAAATAACAACAAAAATATCTCTATGAAAGATTCATACACTCAAACAATCGCATCTGATGAGAATATATTAAGTAAGGAAAGCAATGAAGCGACTGCCTGGACAGTTGACAATACAACAAGCGAGAAGAGTATAGCCGAACAAGTAAAAGAAGCAGCTCAGAGTGCTCTACAAGATTCGGGGATGGTATTTGTTGAATCAATGGGAATGTACTATGATTATAAGACTGGCTATTATTATAACTCTGAACTCGGTCTCTATTATCACACTGACACAGGCTGCTATTATTATTATTCAGATGAGAAGAAGTCCTTTGTTTTTCACTCATATCCCGACAAAAGTGCTGACAATATCGCATTTGAAGCACACGAAAAGAGGAAAGCTAGGAAGCATAAAAAGGCAGCTAAATCTGATGGTATAGAAAACCTGACGAAACAATTGAATCAGGACAATGAAGAAGGCAGTGAGCCGAAACGTAAGAAGAAAGAAAATGTTAAGCAAAAAGACTTTAAGGATATAGATGGGAACGGGAAAAGTGATGGCTTTGTACAAGATTCTGTGGTGAAAGATGACCTGGAAGATGGTGAATGTAGCGAGAGCTCGGACGATGAGAATGTTGAGTCTGATGCGAGTACGGCCTCTCAGAGTGATGATGACTCGGTGGCCAAACATCATCCTCCGTGTATGCGTATCATAGTGCGAGAGACCAGTCTACAGAAACTGAAAGTCGGGAGTCTGTTTGTGATCACTAAAGATGGCGGCAGCGTCGGCCGCGAGGGAAGTGACCACGCCTTCCTACTGAGAGATCATAACGTATCCCGAAACCATCTGGATATAAAATATGACATGAACAGAAGAATGTACGTCGCTATTGATTTGGGATCCAAGAACGGCACCATACTTAACGGGAACCGGATGTCGGAGAGTCAGACGGTCAGCCAGCCGATGGAGATAGTCCACGGCAGCACTCTGCAGCTCGGCGAGACCAAGCTGTTGTGTCACGTCCACCCTGGCAACGACACGTGCGGTCACTGTGAACCTGGCCTCATCATGGAATATAAAGAGAAGGTGGCGTACACGAGGACCTGCAGCGTTCAGAAACAGTATCAGCTGGAGTTGGCCAGGTTAAAGAACAAGTATGCCCCAACACCACTCGCCATAGAGGAGACGGCTTACAACGATCGAGCCAGAACGAGGCGGGAGACGGTCGGGTCCTCACATCACTCCGAGAAGACGGTGGCCAGTGACGTGCACACATTTATTGCACCAGAAAACAAGGGTTTCAAGCTGTTAGAGAAGATGGGCTGGTCGAAGGGCGAAGGACTCGGCAAGGACTCTCAAGGAGACCAGGAACCGGTGAGAGCTATATAG

Protein sequence:

>DPOGS207512-PA
MMEKPDEGCTIIKNSGNHHKRKMFNLRKLRLSLKYRPKVYELILKMRQYIRKKNVLLNKFKCLIKEKKIIETENSHTQTSTETIKKKVSNNNKNISMKDSYTQTIASDENILSKESNEATAWTVDNTTSEKSIAEQVKEAAQSALQDSGMVFVESMGMYYDYKTGYYYNSELGLYYHTDTGCYYYYSDEKKSFVFHSYPDKSADNIAFEAHEKRKARKHKKAAKSDGIENLTKQLNQDNEEGSEPKRKKKENVKQKDFKDIDGNGKSDGFVQDSVVKDDLEDGECSESSDDENVESDASTASQSDDDSVAKHHPPCMRIIVRETSLQKLKVGSLFVITKDGGSVGREGSDHAFLLRDHNVSRNHLDIKYDMNRRMYVAIDLGSKNGTILNGNRMSESQTVSQPMEIVHGSTLQLGETKLLCHVHPGNDTCGHCEPGLIMEYKEKVAYTRTCSVQKQYQLELARLKNKYAPTPLAIEETAYNDRARTRRETVGSSHHSEKTVASDVHTFIAPENKGFKLLEKMGWSKGEGLGKDSQGDQEPVRAI-