NAME Parse::Path - Parser for paths SYNOPSIS use v5.10; use Parse::Path; my $path = Parse::Path->new( path => 'gophers[0].food.count', style => 'DZIL', # default ); my $step = $path->shift; # { key => 'count', ... } say $path->as_string; $path->push($path, '[2]'); foreach my $p (@$path) { say sprintf('%-6s %s --> %s', @$p{qw(type step key)}); } DESCRIPTION Parse::Path is, well, a parser for paths. File paths, object paths, URLs... A path is whatever string that can be translated into hash/array keys. Unlike modules like File::Spec or File::Basename, which are designed for interacting with file systems paths in a portable manner, Parse::Path is designed for interacting with *any* path, filesystem or otherwise, at the lowest level possible. Paths are split out into steps. Internally, these are stored as "step hashes". However, there is some exposure to these hashes as both input and output, so we'll describe them here: { type => 'HASH', # must be either HASH or ARRAY key => 'foo bar', # as it would be represented as a key step => '"foo bar"', # as it would be represented in a path pos => 'X+1', # used to determine depth } For the purposes of this manual, a "step" is usually referring to a step hash, unless specified. CONSTRUCTOR my $path = Parse::Path->new( path => $path, # required style => 'DZIL', # default ); Creates a new path object. Parse::Path is really just a dispatcher to other Parse::Path modules, but it serves as a common API for all of them. Accepts the following arguments: path path => 'gophers[0].food.count' String used to create path. Can also be another Parse::Path object, a step, an array of step hashes, an array of paths, or whatever makes sense. This parameter is required. style style => 'File::Unix' style => '=MyApp::Parse::Path::Foobar' Class used to create the new path object. With a "=" prefix, it will use that as the full class. Otherwise, the class will be intepreted as "Parse::Path::$class". Default is DZIL. auto_normalize auto_normalize => 1 my $will_normalize = $path->auto_normalize; $path->auto_normalize(1); If on, calls "normalize" after any new step has been added (ie: new, "unshift", "push", "splice"). Default is off. This attribute is read-write. auto_cleanup auto_cleanup => 1 my $will_cleanup = $path->auto_cleanup; $path->auto_cleanup(1); If on, calls "cleanup" after any new step has been added (ie: new, "unshift", "push", "splice"). Default is off. This attribute is read-write. METHODS step_count my $count = $path->step_count; Returns the number of steps in the path. Unlike "depth", negative-seeking steps (like ".." for most file-based paths) will not lower the step count. depth my $depth = $path->depth; Returns path depth. In most cases, this is the number of steps to the path, a la "step_count". However, relative paths might make this lower, or even negative. For example: my $path = Parse::Path->new( path => '../../../foo/bar.txt', path_style => 'File::Unix', ); say $path->step_count; # 5 say $path->depth; # -1 Despite the similarity to the pos value of a step hash, this method doesn't tell you whether it's relative or absolute. Use "is_absolute" for that. is_absolute my $is_absolute = $path->is_absolute; Returns a true value if this path is absolute. Hint: most paths are relative. For example, if the following paths were File::Unix paths: foo/bar.txt # relative ../bar.txt # relative bar.txt # relative /home/foo/bar.txt # absolute /home/../bar.txt # absolute (even prior to cleanup) as_string my $path_str = $path->as_string; Returns the string form of the path. This involves taking the individual step strings of the path and placing the delimiters in the right place. as_array my $step_hashes = $path->as_array; Returns the full path as an arrayref of step hashes. The steps are cloned for integrity. If you want a simplier representation of the path, consider "as_string". shift my $step_hash = $path->shift; Works just like the Perl version. Removes a step from the beginning of the path and returns it. The step is cloned for integrity. pop my $step_hash = $path->pop; Works just like the Perl version. Removes a step from the end of the path and returns it. The step is cloned for integrity. unshift my $count = $path->unshift($step_or_path); Works just like the Perl version. Adds a step (or other path-like thingy) to the beginning of the path and returns the number of new steps prepended. Will also call "cleanup" afterwards, if "auto_cleanup" is enabled. push my $count = $path->push($step_or_path); Works just like the Perl version. Adds a step (or other path-like thingy) to the end of the path and returns the number of new steps appended. Will also call "cleanup" afterwards, if "auto_cleanup" is enabled. splice my @step_hashes = $path->splice($offset, $length, $step_or_path); my @step_hashes = $path->splice($offset, $length); my @step_hashes = $path->splice($offset); my $last_step_hash = $path->splice($offset); Works just like the Perl version. Removes elements designated by the offset and length, and replaces them with the new step/path. The steps are cloned for integrity. Returns the steps removed in list context, or the last step removed in scalar context. Will also call "cleanup" afterwards, if "auto_cleanup" is enabled. clear $path->clear; Clears out the path. Returns itself for chaining. replace $path->replace; Replaces the path with a new one. Basically just sugar for "clear" + "push". Unlike the argument form of "clone", this retains the same object and just replaces the internal path. Returns the number of new steps created. clone my $same_path = $path->clone; my $similar_path = $path->clone($new_path); Clones the path object and returns it. Optionally takes another path (object or string or whatever) and puts that path into the clone. This is handy if you want to use the same options and class, but just want a different path. normalize $path->normalize; Normalizes the steps in the path. This ensures that the keys of the step hash and the steps will be the same thing. Or to put it another way, this will make a "round trip" of string-to-path-to-string work commutatively. For example, if the following paths were DZIL paths: '"Oh, but it can..." said the spider'.[0].value # Before normalize "\"Oh, but it can...\" said the spider"[0].value # After normalize a.b...c[0].""."".'' # Before normalize a.b.""."".c[0]."".""."" # After normalize Returns itself for chaining. cleanup $path->cleanup; Cleans up the path. Think of this in terms of "cleanup" within Path::Class. This will remove unnecessary relative steps, and try as best as possible to present an absolute path, or at least one that progresses in a sequential manner. For example, if the following paths were File::Unix paths: /foo/baz/../foo.txt # /foo/foo.txt /foo//baz/./foo.txt # /foo/baz/foo.txt ../../foo/../bar.txt # ../../bar.txt ./command # command Returns itself for chaining. UTILITY METHODS These step conversion methods are available to use, but are somewhat internal, so they might be subject to change. In most cases, you can use the more public methods to achieve the same goals. key2hash my $step_hash = $path->key2hash($key, $type, $pos); my $step_hash = $path->key2hash($key, $type); Figures out the missing pieces of a key/type pair, and returns a complete four-key step hash. The "normalize" method works by throwing away the existing step and using this method. Since pos translation works by using both step+delimiter, and "key2hash" doesn't have access to the delimiter, it's more accurate to pass the pos value than leave it out. path_str2array my $path_array = $path->path_str2array($path_str); Converts a path string into a path array (of step hashes). shift_path_str my $step_hash = $self->shift_path_str(\$path_str); Removes a step from the beginning of the path string, and returns a complete four-key step hash. This is the workhorse for most of Parse::Path's use cases. blueprint my $data = $self->blueprint->{$blueprint_key}; Provides access to the blueprint for parsing the path style. More informaton about what this hashref contains in the role documentation. Cloned for sanity. Create your own Path class if you need to change the specs. OVERLOADS In addition to its standard methods, Parse::Path also has several overloads that are useful: String Concatenation (.=) $path .= 'q.r.s[1]'; $path .= [qw( q r s[1] )]; Modifies the path by calling "push" on the RHS thing. Numeric Comparisons $pathA < $pathB $pathA <= $pathB $pathB > $pathA $pathB >= $pathA $pathA == $pathA $pathA != $pathB $pathA <=> $pathB Uses "depth" for the numeric comparison. Still works in cases of a non-path on one side. String Comparisons $pathA lt $pathB $pathA le $pathB $pathB gt $pathA $pathB ge $pathA $pathA eq $pathA $pathA ne $pathB $pathA cmp $pathB If both sides are P:P objects, each key of the path is compared separately until a difference is found. This effectively bypasses delimiters as an obstacle for path comparisons. If a step is found to be an ARRAY type on both sides, a numeric comparison ("<=>") is done. Mismatched step types are allowed (and checked with "cmp"), so sanity check your paths if this isn't desired. If either side is a non-path, this will fallback to a simple path string comparison. Other overloads !$path # !$path->step_count (ie: does the path contain anything?) "$path" # $path->as_string 0+$path # $path->depth $$path # $path->as_string @$path # @{ $path->as_array } These all work pretty much as you'd expect them to. CONVERSION Different path styles can be used with ease. Convert Unix paths to Window paths? No problem: my $unix_path = Parse::Path->new( path => '/root/tmp/file.txt', style => 'File::Unix' ); my $win_path = Parse::Path->new( path => $unix_path, style => 'File::Win32', ); $win_path->as_string; # \root\tmp\file.txt $win_path->volume('C'); $win_path->as_string; # C:\root\tmp\file.txt $win_path->splice(-1, 1, '..\foobar.gif'); $win_path->cleanup->as_string; # C:\root\foobar.gif $unix_path->replace($win_path); $unix_path->as_string; # /root/foobar.gif CAVEATS Absolute paths and step removal Steps can be removed from the path as needed, but keep in mind that "cleanup" doesn't get called methods like "shift", even if "auto_cleanup" is set. This doesn't make a difference on absolute paths as the depth they are given is permanent. Appending two absolute paths may end up cancelling each other out: my $path = Parse::Path->new( path => '/root/tmp/file.txt', style => 'File::Unix', auto_cleanup => 1, ); $path->shift; # remove the blank root $path->shift; # now a dangling 'tmp/file.txt', tied to position 2 $path->unshift('/home/bbyrd'); $path->as_string; # /home/tmp/file.txt This problem can be sidestepped by using the string forms: $path->shift; $path->shift; # tmp/file.txt $path->replace( [ '/home/bbyrd', $path->as_string ] ); $path->as_string; # /home/bbyrd/tmp/file.txt This may be fixed in a later release. Normalization of splits While "auto_normalize" controls normalization of steps, delimiter normalization is still automatic. For example: my $path = Parse::Path->new( path => 'foo//////bar.txt', style => 'File::Unix', ); say $path->as_string; # foo/bar.txt This is because delimiters are not actually stored anywhere after parsing. The "as_string" method takes the hash steps and re-adds the delimiters, per rules on the blueprint of the path class. (See "delimiter_placement" in Parse::Path::Role::Path.) Sparse arrays and memory usage Since arrays within paths are based on indexes, there's a potential security issue with large indexes causing abnormal memory usage with certain modules that would use these paths. In Perl, these two arrays would have drastically different memory footprints: my @small; $small[0] = 1; my @large; $large[999999] = 1; This can be mitigated by making sure the Path style you use will limit the total digits for array indexes. Parse::Path handles this on all of its paths, but it's something to be aware of if you create your own path classes. SEE ALSO Data::SplitSerializer - Uses this module for path parsing AVAILABILITY The project homepage is . The latest version of this module is available from the Comprehensive Perl Archive Network (CPAN). Visit to find a CPAN site near you, or see . SUPPORT Internet Relay Chat You can get live help by using IRC ( Internet Relay Chat ). If you don't know what IRC is, please read this excellent guide: . Please be courteous and patient when talking to us, as we might be busy or sleeping! You can join those networks/channels and get help: * irc.perl.org You can connect to the server at 'irc.perl.org' and talk to this person for help: SineSwiper. Bugs / Feature Requests Please report any bugs or feature requests via . AUTHOR Brendan Byrd COPYRIGHT AND LICENSE This software is Copyright (c) 2013 by Brendan Byrd. This is free software, licensed under: The Artistic License 2.0 (GPL Compatible)