Middle+ Data Scientist LLM+NLP

Прямой работодатель  WaveAccess ( waveaccess.ru )
Тбилиси, Грузия
Миддл • Сеньор
Аналитика, Data Science, Big Data • Data scientist • Data Science • Python • SQL • Заказная разработка • Natural Language Processing (NLP)
19 июня
Удаленная работа • Работа в офисе
Опыт работы от 3 до 5 лет
Работодатель  WaveAccess
Описание вакансии

WaveAccess is looking for a Data Scientist to join our team and contribute to innovative projects in the pharmaceutical domain. This role involves working with real-world pharmaceutical data and leveraging the power of Large Language Models (LLMs) to drive impactful insights and solutions.

Responsibilities:

  • LLM Integration: Develop, fine-tune, and implement Large Language Models to analyze and process diverse sets of text and medical data.
  • Data Analysis: Perform advanced data analysis on real-world pharmaceutical datasets to extract meaningful insights and support decision-making processes.
  • Text Mining and NLP: Utilize natural language processing techniques to extract relevant information from large volumes of text, including medical literature, patient records, and clinical trial data.
  • Model Development: Build and validate predictive models to address key challenges in the pharmaceutical industry, such as drug efficacy, patient outcomes, and adverse event prediction.
  • Innovation: Stay up-to-date with the latest advancements in LLMs and NLP, and apply innovative approaches to solve complex problems in the pharmaceutical field.

Requirements:

  • At least 4 years of experience in a Data Scientist position
  • English - B2
  • Deep knowledge of Neural Networks and architectures for working with sequences, in particular (RNN, LSTM, Transformers, CNN, attention).
  • Experience with Large Language Models (LLMs) and their application. Familiarity with modern LLM techniques such as Retrieval-Augmented Generation (RAG) and LLM agents.
  • Familiarity with Langchain, Llamaindex concepts and vector DBs
  • Solid Python skills
  • Experience in presenting achieved results

Technologies:

  • Python
  • Transformers
  • LLM models (GPT, LLama, mixtral, etc.)
  • ollama/vllm
  • Standard NLP stack
  • Basic SQL
  • Git
  • Vector databases(Postgres+pgvector / Milvus/ Qdrant/ Faiss)

Preferred:

  • huggingface
  • open llm
  • Knowledge of general Machine Learning approaches
  • Knowledge of mathematical statistics.
  • Experience with AWS (EC2, S3)
  • Linux + bash, ssh
  • Experience in written and verbal communication with business stakeholders
  • Experience with full development cycle

Nice to have:

  • RestAPI development experience
  • Truefoundry
  • Haystack
  • DataIku
  • H2O
  • Snowflake
  • Docker
  • Understanding of CI/CD
  • Java/C++/Other languages

We offer the following conditions:

  • Employment according to the Labor Code, 100% payment of sick leave and vacation
  • Voluntary medical insurance (VMI) including dental coverage
  • Work using flexible development methodology (Agile/Scrum)
  • Flexible start of the working day
  • Weekly seminars, participation in conferences and meetups, and paid certification exams.

Загрузка формы отклика...