User Tools


Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
lunchtime_byte_2023 [2022/12/21 12:33]
maly restructuralised
lunchtime_byte_2023 [2023/01/05 12:39] (current)
maly cc* English
Line 1: Line 1:
-====== PAIREF in CCP4 - lunchtime byte ======+====== PAIREF in CCP4 - CCP4SW Lunchtime Byte ======
  
-Jan 2023+5th Jan 2023
  
-//PAIREF// **depends** on the installation of the [[http://www.ccp4.ac.uk/|CCP4 Software Suite]] or [[https://www.phenix-online.org/|PHENIX]]+[[https://pairef.fjfi.cvut.cz/docs/​pairef_poli_ccp4sw2023/​20230105_ccp4sw_pairef.pdf|{{pdf_icon.png?​30x35}}]] [[https://pairef.fjfi.cvut.cz/docs/​pairef_poli_ccp4sw2023/​20230105_ccp4sw_pairef.pdf|Download presentation in PDF]]
  
-In this short tutorial, we submit a paired refinement job using //PAIREF// program, now distributed in the [[http://​www.ccp4.ac.uk/​|CCP4 suite]].+In this short tutorial, we submit a paired refinement job using the //PAIREF// program, now distributed in the [[http://​www.ccp4.ac.uk/​|CCP4 suite]]. You can use our example data from interferon gamma from //​Paralichthys olivaceus// (POLI) ​ (([[https://​www.sciencedirect.com/​science/​article/​pii/​S1050464818302651|Zahradnik et al. (2018). Fish Shellfish Immunol. 79:​140–152]])),​ PDB entry [[https://​www.rcsb.org/​structure/​6F1E|6F1E]].
  
-==== Download example data ==== 
  
-In this tutorial, we will deal with the data from interferon gamma from //​Paralichthys olivaceus// (POLI) ​ (([[https://​www.sciencedirect.com/​science/​article/​pii/​S1050464818302651|Zahradnik et al. (2018). Fish Shellfish Immunol. 79:​140–152]])),​ PDB entry [[https://​www.rcsb.org/​structure/​6F1E|6F1E]].+==== Download example ​data ====
  
-[[https://​pairef.fjfi.cvut.cz/​docs/​pairef_poli_webinar/​PAIREF_webinar_poli.zip|Download the archive with the data]] and extract it in a working folder. The archive contains:+[[https://​pairef.fjfi.cvut.cz/​docs/​pairef_poli_webinar/​PAIREF_webinar_poli.zip|Download the archive with the POLI  ​data]] and extract it in a folder - we will refer this folder as //working folder//. The archive contains:
   * Structure model previously refined against data at 2.3 Å - //​poli67_webinar_2-3A.pdb//​.   * Structure model previously refined against data at 2.3 Å - //​poli67_webinar_2-3A.pdb//​.
   * Merged diffraction data up to 1.9 Å - //​poli67_1-9A.mtz//​   * Merged diffraction data up to 1.9 Å - //​poli67_1-9A.mtz//​
   * Unmerged diffraction data up to 1.9 Å - //​poli67_XDS_ASCII_1-9A.HKL//​   * Unmerged diffraction data up to 1.9 Å - //​poli67_XDS_ASCII_1-9A.HKL//​
  
-**If you have installed //PAIREF// and extract the data, you are ready for the webinar.** :-) 
  
-==== Open console in working directory ====+==== Launch PAIREF from console in working directory ==== 
 + 
 +Implementation of //PAIREF// to //CCP4i2// interface is under development but it is not available in the latest CCP4 version 8.0.007 yet. Thus, we will use the standalone [[https://​pairef.fjfi.cvut.cz/​docs/​gui.html|graphical user interface (GUI)]] for //PAIREF// that can be launched from command line.
  
 In Windows, find the CCP4 console in the Start menu and open it (see the screenshots below). In GNU/Linux or macOS, just open the terminal, assuming all the executables for CCP4 are available there. In Windows, find the CCP4 console in the Start menu and open it (see the screenshots below). In GNU/Linux or macOS, just open the terminal, assuming all the executables for CCP4 are available there.
Line 24: Line 24:
 {{:​ccp4console.gif?​nolink|}} ​ {{:​ccp4console.gif?​nolink|}} ​
  
-Then go to the folder where your structure model and diffraction data are saved using the command cd. For example, if you saved those three files in folder ''​C:/​Users/​Lab/​PAIREF_tutorial_poli'',​ write ''​cd C:/​Users/​Lab/​PAIREF_tutorial_poli''​ into the console and press ''​Enter''​.+Then go to the folder where your structure model and diffraction data are saved using the command ​''​cd''​. For example, if you saved those three files in the folder ''​C:/​Users/​Lab/​PAIREF_tutorial_poli'',​ write ''​cd C:/​Users/​Lab/​PAIREF_tutorial_poli''​ into the console and press ''​Enter''​.
  
-==== Run PAIREF in command ​line ==== +Now execute the following ​command for launching the PAIREF GUI:
-We will add three high-resolution shells step by step: 2.3-2.10 Å, 2.1-2.0 Å and 2.0-1.9 Å. Run //PAIREF//: <​code>​ccp4-python -m pairef --XYZIN poli67_webinar_2-3A.pdb --HKLIN poli67_1-9A.mtz -u poli67_XDS_ASCII_1-9A.HKL -r 2.1,2.0,1.9 -w 0.06 -p poli_webinar</​code>​ +
-Note: The structure model was refined using X-ray weight 0.06, we have to keep this setting to gain unbiased results. +
- +
-==== Run PAIREF using its graphical user interface ==== +
-//PAIREF// provides a standalone [[https://​pairef.fjfi.cvut.cz/​docs/​gui.html|graphical user interface (GUI)]] ​for intuitive setting of arguments without making up a command. For launching the PAIREF GUI, execute the following command:+
 <​code>​ccp4-python -m pairef --gui</​code>​ <​code>​ccp4-python -m pairef --gui</​code>​
 +
 +The structure of POLI was originally refined at 2.3 Å resolution. Nevertheless,​ we will inspect an impact of the reflection beyond this high-resolution cutoff on the model quality, We have data processed up to 1.9 Å resolution. Thus, we will add three high-resolution shells step by step: 2.3-2.1 Å, 2.1-2.0 Å and 2.0-1.9 Å.
 +
 +In the //PAIREF// window, we need to specify:
 +  * How we want to add high-resolution shells - put 2.1,2.0,1.9 to "​Explicit definition of high-resolution shells"​
 +  * Input structure model
 +  * Input merged diffraction data that contains free flags
 +  * Input unmerged data - //​recommended,​ not required//
 +  * X-ray weight term 0.06 - //from prior knowledge for this particular data set//
 +
 {{ :​pairef_webinal_poli_gui_win.png?​nolink |}} {{ :​pairef_webinal_poli_gui_win.png?​nolink |}}
  
 +And now we can press RUN and open the HTML log file in a web browser.
  
-==== Interpretation of results ​==== +==== Results ​====
-Follow the note ''​%%------>​ RESULTS AND THE CURRENT STATUS OF CALCULATIONS ARE LISTED IN A HTML LOG FILE%%''​ in the program output and open the stated file in your preferred web browser (//e.g.// Firefox). The results should look similar to ours: [[https://​pairef.fjfi.cvut.cz/​docs/​pairef_poli_webinar/​PAIREF_poli_webinar.html]].+
  
-The first thing that should ​be checked is whether the refinements have convergedScroll at the very bottom of page, here you can see plots of Rwork and Rfree vsrefinement cycleWe can conclude that all the refinements have converged.+The results ​should ​look similar to ours: [[https://​pairef.fjfi.cvut.cz/​docs/​pairef_poli_ccp4sw2023/​PAIREF_poli_ccp4sw2023.html]].
  
-Rfree decreased after the addition ​of shells 2.3-2.10 Å and 2.1-2.0 Å:+//PAIREF// ran all the calculation and did also an automatic suggestion ​of an optimal high-resolution cut offLet's check the table on top of the HTML log file:
  
-{{ :poli_webinar_r-values.png?nolink |}}+{{ :poli_ccp4sw2023_verdict_table.png?nolink |}}
  
-Since a perfect model gives an R-value of 0.42 against random data (i.e. pure noise) – assuming non-tNCS (translational non-crystallographic symmetry) data from a non-twinned crystal ((Evans, P. R. & Murshudov, G. N. (2013). Acta Cryst. D69, 1204-1214.)) – a higher R-value ​in the (current) high-resolution shell indicates either ​the involvement of high-resolution ​data without information content (the data are even worse than noise), or poor quality of the model, or the presence of tNCS. This is indicated for the shell  2.0-1.9 Å. +The suggestion is based on the results plotted ​in the following graphs. Overall Rfree decreased for all the three high-resolution ​shells that denotes ​model improvement:
-{{ :poli_webinar_rfree.png?​nolink |}}+
  
-CC* a model-independent measure of noise is in the diffraction data. CC* is higher than CCwork in whole resolution range (except the shell 2.0-1.9 Å where CC* is undefined due to negative CC1/2. Thus, the overfitting was not indicated. To access overfitting,​ it is not needed to test set, so the comparison of CC* with CCwork is much better then with CCfree as CCwork is calculated on more data. +{{ :poli_ccp4sw2023_r-values.png?nolink |}}
-{{ :poli_webinar_ccwork.png?nolink |}}+
  
-**We can conclude that the high-resolution ​cutoff can be set to 2.Å.**+However, we should take into account multiple criteria. Since a perfect model gives an R-value of 0.42 against random data (i.e. pure noise) – assuming non-tNCS (translational non-crystallographic symmetry) data from a non-twinned crystal ((Evans, P. R. & Murshudov, G. N. (2013). Acta Cryst. D69, 1204-1214.)) – a higher R-value in the (current) high-resolution shell indicates either the involvement of high-resolution data without information content (the data are even worse than noise), or poor quality of the model, or the presence of tNCS. This is indicated for the shell  2.0-1.9 Å. 
 +{{ :​poli_ccp4sw2023_rfree.png?​nolink |}} 
 + 
 +CC* is a model-independent measure of noise is in the diffraction data. For this data set, CC* is higher than CCwork in the whole resolution range, except the shell 2.0-1.9 Å where CC* is undefined due to negative CC1/2. That means overfitting was not indicated but the shell 2.0-1.9 Å should be discarded because these data are very noisy. Note that to access overfitting,​ it is not needed to use test set, so the comparison of CC* with CCwork is much better than with CCfree as CCwork is calculated on more data. 
 +{{ :​poli_ccp4sw2023_ccwork.png?​nolink |}} 
 + 
 +We should also check whether the refinements have converged. Scroll at the very bottom of page, here you can see plots of Rwork and Rfree vs. refinement cycle. We can conclude that all the refinements have converged, indeed. 
 + 
 +**We can conclude that the high-resolution ​limit of the data is ca. 2.Å.**
  
 Merging statistics: Merging statistics:
Line 64: Line 75:
 07      2.10   ​2.00 ​ 49772   ​4021 ​  ​12.38 ​ 98.51       ​0.3 ​    ​0.1 ​  ​16.989 ​  ​17.721 ​   4.963   ​0.027 ​  ​0.008 ​  ​0.2293 07      2.10   ​2.00 ​ 49772   ​4021 ​  ​12.38 ​ 98.51       ​0.3 ​    ​0.1 ​  ​16.989 ​  ​17.721 ​   4.963   ​0.027 ​  ​0.008 ​  ​0.2293
 08      2.00   ​1.90 ​ 35920   ​4046 ​   8.88  81.18       ​0.1 ​    ​0.0 ​  ​41.435 ​  ​43.993 ​  ​14.417 ​ -0.132 ​ -0.016 ​  ​N/​A</​code>​ 08      2.00   ​1.90 ​ 35920   ​4046 ​   8.88  81.18       ​0.1 ​    ​0.0 ​  ​41.435 ​  ​43.993 ​  ​14.417 ​ -0.132 ​ -0.016 ​  ​N/​A</​code>​
 +
 +==== Stay tuned, CCP4i2 interface for PAIREF is coming soon! ====
 +
 +{{ :​pairef_ccp4i2_dev.png?​nolink |}}
 +
 +==== Run PAIREF in command line ====
 +
 +It is also possible to run //PAIREF// in the command line. The job described here could be executed using the following command: <​code>​ccp4-python -m pairef --XYZIN poli67_webinar_2-3A.pdb --HKLIN poli67_1-9A.mtz -u poli67_XDS_ASCII_1-9A.HKL -r 2.1,2.0,1.9 -w 0.06 -p poli</​code>​
 +
  
 ===== Contact ===== ===== Contact =====