Workflow crashing with file not found due to strange nipype.pipeline.engine.utils.modify_path behavior #2944

stilley2 · 2019-06-13T20:28:48Z

Summary

When getting the node results, the modify_path function will mistakenly identify some outputs as files, causing the run to fail.

Actual behavior

Workflow crashes with a file not found error due to output being mistakenly identified as a file. This occurs when the output is a string (or bytes? untested), and a file with the same name exists in the current working directory.

Expected behavior

modify_path does not identify the object as a file.

How to replicate the behavior

See below

Script/Workflow details

The following script will fail if there is a file named "22" in the directory from which it is called.

from nipype.pipeline import engine as pe
from nipype.interfaces.base import BaseInterfaceInputSpec, TraitedSpec, BaseInterface, traits


class Concat2InputSpec(BaseInterfaceInputSpec):
    x = traits.String()


class Concat2OutputSpec(TraitedSpec):
    y = traits.String()


class Concat2(BaseInterface):
    input_spec = Concat2InputSpec
    output_spec = Concat2OutputSpec

    def _run_interface(self, runtime):
        self._outval = self.inputs.x + '2'
        return runtime

    def _list_outputs(self):
        outputs = self._outputs().get()
        outputs['y'] = self._outval
        return outputs


if __name__ == '__main__':
    node = pe.Node(Concat2(x=2), 'double')
    node.run()

Please put URL to code or code here (if not too long).

Platform details:

{'commit_hash': '7fbc7cbac',
 'commit_source': 'repository',
 'networkx_version': '2.3',
 'nibabel_version': '2.4.1',
 'nipype_version': '1.2.1-dev+g7fbc7cbac',
 'numpy_version': '1.16.4',
 'pkg_path': '/home/stilley2/dev/nipype_debug/nipype/nipype',
 'scipy_version': '1.3.0',
 'sys_executable': '/home/stilley2/dev/nipype_debug/pyenv/bin/python',
 'sys_platform': 'linux',
 'sys_version': '3.7.3 (default, May 11 2019, 00:38:04) \n'
                '[GCC 9.1.1 20190503 (Red Hat 9.1.1-1)]',
 'traits_version': '5.1.1'}

Execution environment

My python environment outside container

The text was updated successfully, but these errors were encountered:

Addresses nipy#2944

effigies · 2019-06-18T23:24:34Z

For added context, running the provided script produces the following error:

190618-19:12:45,52 nipype.workflow INFO:
	 [Node] Setting-up "double" in "/private/var/folders/yb/10x6vxlx40s1dhv5vc163m680000gp/T/tmpwhu6niza/double".
190618-19:12:45,57 nipype.workflow INFO:
	 [Node] Running "double" ("__main__.Concat2")
Traceback (most recent call last):
  File "test.py", line 29, in <module>
    node.run()
  File "/anaconda3/lib/python3.6/site-packages/nipype/pipeline/engine/nodes.py", line 487, in run
    self, report_type='postexec', is_mapnode=isinstance(self, MapNode))
  File "/anaconda3/lib/python3.6/site-packages/nipype/pipeline/engine/utils.py", line 149, in write_report
    result = node.result  # Locally cache result
  File "/anaconda3/lib/python3.6/site-packages/nipype/pipeline/engine/nodes.py", line 197, in result
    return _load_resultfile(self.output_dir(), self.name)[0]
  File "/anaconda3/lib/python3.6/site-packages/nipype/pipeline/engine/utils.py", line 359, in load_resultfile
    basedir=path).items()):
  File "/anaconda3/lib/python3.6/site-packages/nipype/pipeline/engine/utils.py", line 478, in modify_paths
    val, relative=relative, basedir=basedir)
  File "/anaconda3/lib/python3.6/site-packages/nipype/pipeline/engine/utils.py", line 498, in modify_paths
    raise IOError('File %s not found' % out)
OSError: File /private/var/folders/yb/10x6vxlx40s1dhv5vc163m680000gp/T/tmpwhu6niza/double/22 not found

Further, this was introduced in #2325, due to the saving/loading of the resultfile. I suspect the fact that we're losing trait information is the problem. Still trying to work through whether the correct response is the fix proposed in #2945.

effigies · 2019-06-18T23:31:43Z

cc @oesteban you might see the way to go here better, since #2325 was your refactor.

oesteban · 2019-06-19T05:24:07Z

I'm not sure this is a problem of nipype - although I acknowledge there could be potential file name collisions conducive to this problem and we may want to keep track of existing files in the results file.

I'll wait on the original fMRIPrep issue to understand the problem.

stilley2 · 2019-06-19T13:17:04Z

I'm pretty sure this is a nipype issue and not an fmriprep issue. The above example does not rely on fmriprep, and definitely results in undesirable behavior. I agree with @effigies that the main issue is loss of type information, and trying to recover that information, so #2945 is probably not an ideal solution.

oesteban · 2019-06-19T14:59:09Z

I can only imagine two situations where this is an issue:

If an interface has both an string output and a file output, and both values coincide (which is a pretty particular condition, and I'm not sure there is an interface actually doing this).
If a previous run of a workflow is manipulated adding such a file (which is artificial and this is why I'm not convinced this is a nipype issue).

For that reason, I think we should first replicate and identify the error condition in fMRIPrep and then decide whether it is worth tracking file traits.

WDYT @satra?

stilley2 · 2019-06-19T15:12:37Z

The example script I posted doesn't satisfy either criteria, but still causes an issue. To cause this issue all that is required is:

an interface has a string output
A file with a name equal to that string exists in the directory from which the python script was called (i.e., not the workflow's working directory).

oesteban · 2019-06-20T10:07:08Z

Gotcha, I hadn't realized of number 2. Well, I guess then #2945 should address this issue.

effigies · 2019-06-20T12:39:26Z

One question I have is why we're modifying paths in the resultfile.

oesteban · 2019-06-23T00:36:07Z

It is a buggy filter to convert paths to relative (before storing) and then back to absolute (after loading).

effigies · 2019-06-23T13:01:57Z

Sure, but why?

oesteban · 2019-06-23T16:44:37Z

I would imagine (and that was there before my refactor) that it is to make the result file work even if you move the work directory somewhere else. For a single run (no reuse), I agree it doesn't make much sense. @satra may have a better understanding of this.

satra · 2019-06-23T19:40:31Z

@oesteban and @effigies - this should be controlled through a config option:

https://github.com/nipy/nipype/blob/master/nipype/utils/config.py#L65

but it sounds like the option is not being implemented properly. the default is false, which means it should not use relative paths unless forced.

effigies · 2019-06-26T14:04:47Z

I think that's being handled inside modify_paths:

nipype/nipype/pipeline/engine/utils.py

Lines 490 to 496 in 1eeabd3

    
           if relative: 
        
               if config.getboolean('execution', 'use_relative_paths'): 
        
                   out = relpath(object, start=basedir) 
        
               else: 
        
                   out = object 
        
           else: 
        
               out = os.path.abspath(os.path.join(basedir, object))

The problem is that we're identifying paths as strings that exist on the filesystem relative to some directory (either CWD or node-WD), which can sometimes be true for String traits, as well. So avoiding relative paths won't save us when the string output is a valid path; forcing to absolute will still modify the string.

satra · 2019-06-26T14:46:40Z

@effigies - in other situations we check the traits metadata to determine if this is a string vs a file. (hash_files=False). a good improvement would be to separate out a Path object that is a string vs a Path object that is a file.

effigies · 2019-06-26T16:33:30Z

in other situations we check the traits metadata to determine if this is a string vs a file. (hash_files=False).

Yes, but this is operating on a dict, without trait information. I'm working on a strategy to pass in traits as type hints.

a good improvement would be to separate out a Path object that is a string vs a Path object that is a file.

Could you clarify? Are you suggesting to use pathlib.Path as the object that is pickled?

satra · 2019-07-01T14:23:16Z

@effigies - i simply meant altering the traits_extension File class to either return a string or a pathlib object depending on the traits metadata.

effigies · 2019-07-15T01:24:06Z

@effigies - i simply meant altering the traits_extension File class to either return a string or a pathlib object depending on the traits metadata.

I've looked at the traits stuff, and have decided I don't really know how to start. Just noting this, in case somebody else has time to play with traits.

oesteban · 2019-07-15T23:43:07Z

Having a look right this minute

oesteban · 2019-07-16T01:12:10Z

@satra the new trait would return pathlike if exists is True or this new metadata is True, correct?

satra · 2019-07-16T01:15:35Z

@oesteban - the metadata already exists, so the validate function in the File trait would return a Path object if hash_files is not False

satra · 2019-07-16T01:16:29Z

i would start with updating the File/Directory trait in traits_extensiion instead of creating a new trait.

Ref nipy#2944

oesteban · 2019-07-16T01:34:30Z

Sorry, I was very unclear. Please check the draft PR above - I was actually modifying File/Directory.

oesteban · 2019-07-16T01:36:04Z

@satra, the metadata name is nohash, correct?

satra · 2019-07-16T01:55:05Z

the metadata name is hash_files, nohash is when a particular traits item should not be included in the hash computation. while hash_files either hashes the string (when False) or the content/timestamp (when True or missing)

Building a solution to nipy#2944, starting from a refactor of ``aggregate_outputs`` to be robuster and perform the referrencing when requested via the new arguiment ``rebase_cwd``.

Two new methods ``resolve_path_traits`` and ``rebase_path_traits`` are being included. They take trait instances from a spec (selected via ``spec.trait('traitname')``, the value and a base path. These two functions will be usefull to progress towards nipy#2944.

Fixes nipy#2944.

Two new methods ``resolve_path_traits`` and ``rebase_path_traits`` are being included. They take trait instances from a spec (selected via ``spec.trait('traitname')``, the value and a base path. These two functions will be usefull to progress towards nipy#2944.

Fixes nipy#2944.

Two new methods ``resolve_path_traits`` and ``rebase_path_traits`` are being included. They take trait instances from a spec (selected via ``spec.trait('traitname')``, the value and a base path. These two functions will be usefull to progress towards nipy#2944.

Fixes nipy#2944.

Once we figure out the problem of ``OutputMultiObject``, we could go ahead and set fix nipy#2944, fix nipreps/fmriprep#1674, close nipy#2945.

It seems that nipy#2944 has uncovered a rats-nest hidden in the engine. In resolving that issue, I found out that a great deal of boilerplate was set in place when loading/saving results to deal with ``OutputMulti{Object,Path}`` traits. The reason being that these traits flatten single-element-list values. This PR fixes the pickling behavior of traited specs containing these types of traits. Additionally, this PR also avoids the ``modify_paths`` function that was causing problems originally in nipy#2944. Therefore, this PR effectively make results files static, meaning: caching if the ``base_dir`` of the workflow is changed will not work anymore. I plan to re-insert this feature (results file mobility) with nipy#2971. This PR is just to split that one in more digestible bits. All the boilerplate mentioned above has been cleaned up.

Fixes nipy#2944.

Once we figure out the problem of ``OutputMultiObject``, we could go ahead and set fix nipy#2944, fix nipreps/fmriprep#1674, close nipy#2945.

Close nipy#2944. Close nipy#2949.

Fixes nipy#2944.

Once we figure out the problem of ``OutputMultiObject``, we could go ahead and set fix nipy#2944, fix nipreps/fmriprep#1674, close nipy#2945.

Close nipy#2944. Close nipy#2949.

Fixes nipy#2944.

Once we figure out the problem of ``OutputMultiObject``, we could go ahead and set fix nipy#2944, fix nipreps/fmriprep#1674, close nipy#2945.

Close nipy#2944. Close nipy#2949.

stilley2 pushed a commit to stilley2/nipype that referenced this issue Jun 13, 2019

Check that file exists in basedir, not current dir

d837038

Addresses nipy#2944

This was referenced Jun 13, 2019

Using res-2 in output-spaces argument causes failure if a file named "2" exists in the current directory nipreps/fmriprep#1674

Closed

PR: FIX Check that file exists in basedir, not current dir #2945

Closed

effigies mentioned this issue Jun 26, 2019

FIX: Use traits to provide type hints when modifying paths #2949

Closed

3 tasks

oesteban added a commit to oesteban/nipype that referenced this issue Jul 16, 2019

ENH: Modify Directory and File traits to get along with pathlib

8124823

Ref nipy#2944

oesteban mentioned this issue Jul 16, 2019

ENH: Modify Directory and File traits to get along with pathlib #2959

Closed

1 task

This was referenced Jul 16, 2019

ENH: Modify Directory and File traits to get along with pathlib #2962

Merged

MAINT: Various minor improvements to complement previous PR #2964

Merged

oesteban mentioned this issue Jul 18, 2019

FIX: Resolving absolute to relative paths in output #2966

Closed

oesteban mentioned this issue Jul 19, 2019

ENH: Add resolve/rebase BasePath traits methods & tests #2970

Merged

1 task

oesteban added a commit to oesteban/nipype that referenced this issue Jul 19, 2019

enh: add resolving to the results loader and rebasing to saver

331bd21

Fixes nipy#2944.

oesteban mentioned this issue Jul 19, 2019

FIX: Resolving/rebasing paths from/to results files #2971

Merged

1 task

oesteban added a commit to oesteban/nipype that referenced this issue Jul 31, 2019

enh: add resolving to the results loader and rebasing to saver

72e2e96

Fixes nipy#2944.

oesteban added a commit to oesteban/nipype that referenced this issue Aug 1, 2019

enh: add resolving to the results loader and rebasing to saver

fb6ad23

Fixes nipy#2944.

oesteban added a commit to oesteban/nipype that referenced this issue Aug 1, 2019

enh: add resolving to the results loader and rebasing to saver

4f1c49c

Fixes nipy#2944.

oesteban mentioned this issue Aug 1, 2019

FIX: Correctly pickle OuputMulti{Object,Path} traits #2983

Merged

1 task

oesteban added a commit to oesteban/nipype that referenced this issue Aug 1, 2019

enh: add resolving to the results loader and rebasing to saver

07312e0

Fixes nipy#2944.

oesteban added a commit to oesteban/nipype that referenced this issue Aug 1, 2019

fix: final touches to the PR

25cb3af

Close nipy#2944. Close nipy#2949.

oesteban mentioned this issue Aug 2, 2019

FIX: Use load_resultfile when loading a results pickle #2985

Merged

1 task

oesteban added a commit to oesteban/nipype that referenced this issue Aug 2, 2019

enh: add resolving to the results loader and rebasing to saver

9b37ea5

Fixes nipy#2944.

oesteban added a commit to oesteban/nipype that referenced this issue Aug 2, 2019

fix: final touches to the PR

1823e31

Close nipy#2944. Close nipy#2949.

oesteban added a commit to oesteban/nipype that referenced this issue Aug 6, 2019

enh: add resolving to the results loader and rebasing to saver

8292a7a

Fixes nipy#2944.

oesteban added a commit to oesteban/nipype that referenced this issue Aug 6, 2019

fix: final touches to the PR

5f917f2

Close nipy#2944. Close nipy#2949.

oesteban closed this as completed in #2971 Aug 7, 2019

effigies added this to the 1.2.1 milestone Aug 16, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Workflow crashing with file not found due to strange nipype.pipeline.engine.utils.modify_path behavior #2944

Workflow crashing with file not found due to strange nipype.pipeline.engine.utils.modify_path behavior #2944

stilley2 commented Jun 13, 2019

effigies commented Jun 18, 2019

effigies commented Jun 18, 2019

oesteban commented Jun 19, 2019

stilley2 commented Jun 19, 2019

oesteban commented Jun 19, 2019 •

edited

Loading

stilley2 commented Jun 19, 2019

oesteban commented Jun 20, 2019

effigies commented Jun 20, 2019

oesteban commented Jun 23, 2019

effigies commented Jun 23, 2019

oesteban commented Jun 23, 2019

satra commented Jun 23, 2019

effigies commented Jun 26, 2019

satra commented Jun 26, 2019

effigies commented Jun 26, 2019

satra commented Jul 1, 2019

effigies commented Jul 15, 2019

oesteban commented Jul 15, 2019

oesteban commented Jul 16, 2019

satra commented Jul 16, 2019

satra commented Jul 16, 2019

oesteban commented Jul 16, 2019

oesteban commented Jul 16, 2019

satra commented Jul 16, 2019

Workflow crashing with file not found due to strange nipype.pipeline.engine.utils.modify_path behavior #2944

Workflow crashing with file not found due to strange nipype.pipeline.engine.utils.modify_path behavior #2944

Comments

stilley2 commented Jun 13, 2019

Summary

Actual behavior

Expected behavior

How to replicate the behavior

Script/Workflow details

Platform details:

Execution environment

effigies commented Jun 18, 2019

effigies commented Jun 18, 2019

oesteban commented Jun 19, 2019

stilley2 commented Jun 19, 2019

oesteban commented Jun 19, 2019 • edited Loading

stilley2 commented Jun 19, 2019

oesteban commented Jun 20, 2019

effigies commented Jun 20, 2019

oesteban commented Jun 23, 2019

effigies commented Jun 23, 2019

oesteban commented Jun 23, 2019

satra commented Jun 23, 2019

effigies commented Jun 26, 2019

satra commented Jun 26, 2019

effigies commented Jun 26, 2019

satra commented Jul 1, 2019

effigies commented Jul 15, 2019

oesteban commented Jul 15, 2019

oesteban commented Jul 16, 2019

satra commented Jul 16, 2019

satra commented Jul 16, 2019

oesteban commented Jul 16, 2019

oesteban commented Jul 16, 2019

satra commented Jul 16, 2019

oesteban commented Jun 19, 2019 •

edited

Loading