Monarch geneset OGS2.0

DPOGS200606
TranscriptDPOGS200606-TA981 bp
ProteinDPOGS200606-PA326 aa
Genomic positionDPSCF300076 - 301967-306354
RNAseq coverage396x (Rank: top 30%)
Annotation
HeliconiusHMEL0061704e-6641.69% 
BombyxBGIBMGA011294-TA4e-10890.05% 
DrosophilaCG6812-PA2e-9856.07% 
EBI UniRef50UniRef50_C1BPS36e-8650.78%Sideroflexin-2 n=1 Tax=Caligus rogercresseyi RepID=C1BPS3_9MAXI
NCBI RefSeqXP_308642.42e-11562.31%AGAP007119-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582862444e-11462.31%AGAP007119-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1565380547e-11763.91%PREDICTED: sideroflexin-2-like [Nasonia vitripennis]
Group
Gene OntologyGO:00160206.1e-196membrane
GO:00550856.1e-196transmembrane transport
GO:00068126.1e-196cation transport
GO:00083246.1e-196cation transmembrane transporter activity
KEGG pathwaysmm:Smp_1369602e-77 
 K03351 (APC4)maps-> Ubiquitin mediated proteolysis
    Meiosis - yeast
    Cell cycle - yeast
    Progesterone-mediated oocyte maturation
    Cell cycle
    Oocyte meiosis
InterPro domain[5-326] IPR0046866.1e-196Tricarboxylate/iron carrier
Orthology groupMCL14020 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200606-TA
ATGACAGAGAAAAGAATTGATATAACAGAACCCCTGTGGGATCAGAGCACCTTTGTCGGAAGATTCCGGCACTTTGCTTTCATATCAAACCCATTATTGTCTATGGCATCGGAAAAAGAACTTTATGAAGCCAAGGAATTATATTTTAAATATAAGGAAGGTAAGGAACCTCCGAATACCCAGTTGACTCAAGTTGTGAGAGCGAAGCAGCTATATGAATCGGCTTTCCACCCTGACAGCGGTGAACTACAGAATGTCTTCGGCAGGATGTCGTTTCAAATGCCCGGAGGTTGTTTAATAACTGGGGCTATGCTGCAATGGTATAGAACAGCAACCGCGGTAGTGTTCTGGCAGTGGGTGAACCAGTCGTTCAACGCTCTGGTCAACTACACGAACAGAAACGCCAACTCTCCCCTATCAACGACACAGATGGGTGTAGCTTACATCTCAGCGACATCGGCCGCTATGGCCACGGCACTAACATTCAAATACGGCATACAGAAACGTGCCAAGAATCCAATACTCGCTAGATTCGTGCCATTCGCTGCGGTCGCAGCAGCCAATTGGGTCAATATACCCTTAATGAGACAAAATGAAATAGTTTTAGGGTTGGATGTGACCGATGAAAACGGTAAAATAATTGGAAAATCCCAAATAGCCCCCGTCAAGGGAATATCACAAGTTGTAACTTCAAGGATAATAATGTGCGCGCCCGGCATGCTGTTACTGCCTGTTATAATGGAGAAAATAGAACCCAAAGCCTGGATGCAGAGAATTAAGTGGGCTCACATCGGCATACAAACAGGAATCGTTGGAATGTTCCTAACTTTCATGGTGCCAACCGCCTGTGCAATATTCCCACAGAAATGTAAACTCTCAATTGATACAATAAAGCGTTTTGAAAAGGACAGATATGAAGAAATTTTGAAGAACACAGACGGCAAGCCGCCAGAATACGTTTATTTCAACAAAGGCCTTTAG

Protein sequence:

>DPOGS200606-PA
MTEKRIDITEPLWDQSTFVGRFRHFAFISNPLLSMASEKELYEAKELYFKYKEGKEPPNTQLTQVVRAKQLYESAFHPDSGELQNVFGRMSFQMPGGCLITGAMLQWYRTATAVVFWQWVNQSFNALVNYTNRNANSPLSTTQMGVAYISATSAAMATALTFKYGIQKRAKNPILARFVPFAAVAAANWVNIPLMRQNEIVLGLDVTDENGKIIGKSQIAPVKGISQVVTSRIIMCAPGMLLLPVIMEKIEPKAWMQRIKWAHIGIQTGIVGMFLTFMVPTACAIFPQKCKLSIDTIKRFEKDRYEEILKNTDGKPPEYVYFNKGL-