Postdoc in confidential hpc for ai in cancer care (100%, 3 years)
BaselUniversität Basel
...Your work focuses on three areas: Preference optimization and LLM alignment: design preference-based training and fine-tuning methods (RLHF, PPO, DPO, reward modeling) for medical and multilingual LLMs. Agentic and tool-augmented AI systems: develop reasoning and interaction capabilities including RAG, in-context learning, [...]
Kategorie Ingenieurwesen