Molecular structure-property relationship by supervised learning
- Input : Structure or protein (X)
- Method
: Convention,
$$ Y = f(X) $$
where f = Schrodinger equation or Hamiltonian
: Modern, feature extraction (L),
$$ Y = f_θ(X)$$
where f = AI or machine learning (DNN, CNN, RNN, GNN, etc), θ = a set of learnable parameters
- Output data : Property (Y), biding affinity.
Modeling (θ) = Maximum likelihood estimate
- Assumption : Random error term follows Gaussian distribution.
θ : ML model
x : observation
y : target distribution
find θ is called machine learning training.
$$ L(θ) = L(θ; x,y) = p(y|x;θ) $$
Machine learnig vs. Deep learning
- Machine learning process (shallow learning)
: Input -> feature extraction (hand-crafted features by experts) -> other ML -> Output
- Deep learning process
: Input -> Deep Network (automated feature extraction by AI) -> Output
- They are different by data size.
Advantages using AI
- Physics vs. machine learning : cost efficiency.
- Inverse design : AI를 통해 원하는 binding affinity를 가지는 구조를 얻을 수 있다. (e.g. hit-to-lead)
Reference
- Why AI for virtual screening l 김우연, https://youtu.be/l2LfNIP82Yw
'Drug' 카테고리의 다른 글
Virtual screening 4 - 3D CNN (0) | 2021.12.12 |
---|---|
Virtual screening 3 - Data sets (0) | 2021.12.11 |
Virtual screening 1 - Intro (0) | 2021.12.10 |
Molecular dynamics software (0) | 2021.09.29 |
신약 개발 과정 (0) | 2021.09.28 |
댓글