Monarch geneset OGS2.0

DPOGS215533
TranscriptDPOGS215533-TA945 bp
ProteinDPOGS215533-PA314 aa
Genomic positionDPSCF300129 - 712690-722849
RNAseq coverage152x (Rank: top 53%)
Annotation
HeliconiusHMEL0061702e-16187.17% 
BombyxBGIBMGA010701-TA3e-15076.58% 
DrosophilaCG11739-PD1e-10154.09% 
EBI UniRef50UniRef50_Q9VN131e-9954.09%AT24389p n=30 Tax=Coelomata RepID=Q9VN13_DROME
NCBI RefSeqXP_313819.47e-10556.92%AGAP004519-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582922951e-10356.92%AGAP004519-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3800244994e-10458.23%PREDICTED: sideroflexin-1-like [Apis florea]
Group
Gene OntologyGO:00160202.9e-199membrane
GO:00550852.9e-199transmembrane transport
GO:00068122.9e-199cation transport
GO:00083242.9e-199cation transmembrane transporter activity
KEGG pathwaysmm:Smp_1369602e-87 
 K03351 (APC4)maps-> Ubiquitin mediated proteolysis
    Meiosis - yeast
    Cell cycle - yeast
    Progesterone-mediated oocyte maturation
    Cell cycle
    Oocyte meiosis
InterPro domain[2-314] IPR0046862.9e-199Tricarboxylate/iron carrier
Orthology groupMCL12528 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215533-TA
ATGAGTCACATAGATTTGGACAAGCCTCGTTATGATCAGAGTACGTACATGGGTAGGGCGAAACATTTCCTTCTGCTAACAAATCCCATGAACGTTTTCGCAAGCAACAAAGATCTTGAGGATGCTAGGAAGATTGTCACTGAGTTCAGGAAGAGTCGTAGGATGCCAGCCGGCTACGACGAGGACAAATTGTGGGCTACCAAGTATTTGTATGATAGCGCCTTCCACCCGGACACGGGGGAGAAGATGATCGCCATCGGACGGATGTCGGCTCAGGCTCCAATGAACACAATCATCACAGGATGCATGATAACCTTTTATAAGACAACAGCGGCGACCGTCTTTTGGCAGTGGGTCAATCAGACGTTTAATGCCCTCGTAAACTATACCAACAGAGGGGGCGACGCTCCCTTACCGACATCCCAGTTATTGGCGTCGTATTGTGCCGCGTGTGGTGGGGCGTTGTCTACTGCACTTTTTCTCAACAGCAAGGTTAAGAACCTGCCACCGATTTATGCTTCCCTGGTACCTTTCGCCGCGGTTTGCGGCGCTAACTTCATAAACATACCTATTGAACTGTTGAATGGTACGCCGGTGTTCACGGCGGACGGTACTCGCATCGGAAACTCTAAGAGGGCTGCCAAATATGGAATCGGACTCGTCTGCATCTCGAGGGTCTTGATGGCTTTACCAGGAATGACGTTGACTCCTATCATAACGAACATAGCCACGCGGCGCGGCTTGTTCTGCCGGCGACCCATGATGGTGATACCATTCCAGCTGTTTCTGGTCGGTCTGTGCGTCACCTTCGCCACGCCGCTGTGCTGCGCCATCTTCGAGCAGAAGGCGTCCATATCGGTGGACAACCTGGATCCAGAGCTCAGGGATAGTGTCAGGAAGAACTACCCAAAAATAAAAGAAGTCTACTTCAATAAGGGTCTATGA

Protein sequence:

>DPOGS215533-PA
MSHIDLDKPRYDQSTYMGRAKHFLLLTNPMNVFASNKDLEDARKIVTEFRKSRRMPAGYDEDKLWATKYLYDSAFHPDTGEKMIAIGRMSAQAPMNTIITGCMITFYKTTAATVFWQWVNQTFNALVNYTNRGGDAPLPTSQLLASYCAACGGALSTALFLNSKVKNLPPIYASLVPFAAVCGANFINIPIELLNGTPVFTADGTRIGNSKRAAKYGIGLVCISRVLMALPGMTLTPIITNIATRRGLFCRRPMMVIPFQLFLVGLCVTFATPLCCAIFEQKASISVDNLDPELRDSVRKNYPKIKEVYFNKGL-