Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Wildcard in route parameter regexp #2495

Closed
bevacqua opened this issue Jan 12, 2015 · 32 comments
Closed

Wildcard in route parameter regexp #2495

bevacqua opened this issue Jan 12, 2015 · 32 comments
Assignees
Labels

Comments

@bevacqua
Copy link

Route '/:username([a-z][a-z-]*)'
Route '/:username([a-z][a-z-]*)/:project([a-z][a-z0-9-]*)'

URL /someone/project

Expectation is that the second route should match.
Actual result is the first route matches because the * is considered as all the things rather than "zero or more" [a-z-].

@dougwilson
Copy link
Contributor

It's a limitation of the current version of path-to-regexp we use. A work-around would be to use the following route: /:username([a-z][a-z-]{0,}) (i.e. use {0,} in place of *).

@dougwilson
Copy link
Contributor

I'm going to check if the most recent (non-backwards-compatible) version of path-to-regexp handles this better.

@dougwilson
Copy link
Contributor

It does. So this should be fixed in 5.0, but as for 4.x, you'll probably have to just use the old syntax :/

@bevacqua

This comment has been minimized.

@dougwilson

This comment has been minimized.

@charlierudolph
Copy link

So is path-to-regexp not going to be updated until version 5.0? That's something I would really like to see as soon as possible. Also the website is misleading because it points to path-to-regexp and its regexp tester but it is currently very out of date.

@dougwilson
Copy link
Contributor

It's not that we don't want to upgrade path-to-regexp before 5.0, but all versions above where we are at have breaking changes and thus must wait until 5.0. We cannot upgrade path-to-regexp is a 4.x minor release if people's existing 4.x routes break.

As for website issues, please file them at https://github.com/strongloop/expressjs.com

@bevacqua
Copy link
Author

bevacqua commented May 7, 2015

For what it's worth, I've been using {0,} without any inconvenients for a long while now.

@charlierudolph

This comment has been minimized.

@bevacqua

This comment has been minimized.

@charlierudolph

This comment has been minimized.

@dougwilson
Copy link
Contributor

So I looked really hard at this issue for Express 4.x, and it can actually be fixed in path-to-regexp 0.1.x series. I'll look into issuing a PR there, but even then, the current 0.1.5 is buggy, so unless the buggyness is reverted or fixed, even if said PR is accepted, we still wouldn't be able to upgrade until the "index issue" is addressed.

@tunniclm
Copy link
Contributor

tunniclm commented Mar 1, 2016

Looking through the related PR (pillarjs/path-to-regexp#57), it looks like it would not be possible to fix this in 4.x in a back-compatible way. Is that correct? Or is it just a matter of fixing it in express rather than path-to-regexp?

@dougwilson
Copy link
Contributor

Sorry, I never updated this issue. Yes, that is correct in that we can't get it fixed in Express 4.x. If you want to take a stab at it, please go ahead! A fresh look never hurts, because it feels like something that should be possible to fix.

@tunniclm
Copy link
Contributor

tunniclm commented Mar 2, 2016

So if I understand correctly, to fix the original issue we want to prevent translation of * within the () part of a token definition so that it will work as expected within the regexp (instread of taking the special meaning of applying to the token).

However, we can't do this because we found an example that documents /files/:file(*) as a valid usage where the * applies to the token, and we don't want to break anyone that reasonably expects documented and working behaviour not be broken by a minor or patch release.

Is that a correct summary?

@blakeembrey
Copy link
Member

blakeembrey commented Mar 2, 2016

Yes.

Edit: It's possible people also relying on that feature now, so removing (and fixing this issue) would break. I know I've had at least one app in the future abusing the * feature within () and not just as (*).

@tunniclm
Copy link
Contributor

tunniclm commented Mar 2, 2016

The only reasonable way to make a back-compatible fix that I can think of would be to add an option you could enable to make Express replace *s inside the () with {0,} before sending it to path-to-regexp.

You could push that option up into a patch on path-to-regexp version 0.x instead if that sits better.

I'm happy to have a stab at that change if it's something that we think is desirable. There is a bit of a philosophy question there -- how keen are we on adding options that will be removed next major version to fix bugs that may cause breakage?

Otherwise, perhaps we should label this issue with something like "wontfix", "limitation" or "low-priority" to indicate we can't fix it in 4.x?

@dougwilson
Copy link
Contributor

The labels are accurate: it's a bug in 4.x, but wont be fixed until 5.0 due to limitations of the implementation we have in 4.x, for better or worse. We keep this issue open for two reasons:

  1. To remind us it needs to be addressed until there is a 5.0 release on npm with this addressed.
  2. Capture the conversation regarding an active bug for users to find when they have the same issue. They can also subscribe to this issue to know when it is fixed in a published version.

As for the potential solution above, any changes to building a regular expression need to land in our own dependency, otherwise we are not only defeating the purpose of owning our own dependencies, but we are cheating the community out of building complex applications, since they would have to copy and paste into their code instead of updating a dependency.

An option, to me, sounds crazy, because if the change would have to be behind a flag, then it sounds like it's not backwards compatible. The way the paths work is an intrinsic property of express, and so a flag to change this behavior at runtime means a community module can no longer know it's routes will just work on a major version of Express, rather they have to instruct users to change this flag or tough luck.

@tunniclm
Copy link
Contributor

tunniclm commented Mar 3, 2016

I wasn't thinking we would remove any labels or close the issue, I just wanted to add an extra label so it is easier to see the status/use in a query. Perhaps something like "fix-in-next-release" would capture it?

For the potential solution, yes, it was a bit of a stretch idea and certainly far from ideal and, on reflection, the existence of a simple workaround makes it unattractive.

I discounted other potential solutions (like special casing (*) or some subset of regexp-likes containing *) since I didn't find docs or tests that explained clearly how * should work inside (). So a reasonable user could form a conclusion about how it works (which would be backed up by the implementation) that would be broken by changing it at all by default.

It's a shame, because I think it is quite surprising behaviour.

@clakech
Copy link

clakech commented Feb 1, 2017

Thanks for documenting this in the issue, that helps a lot

What is the best workaround: using {0,} or (*) ?

both work but which one is a smaller technical debt ?

@dougwilson
Copy link
Contributor

Using {0,} should work the same with 5.x comes out while (*) will likely change meaning in 5, I believe.

@ttencate

This comment has been minimized.

@dougwilson

This comment has been minimized.

@ttencate

This comment has been minimized.

@rjmunro

This comment has been minimized.

@VAggrippino

This comment has been minimized.

@rjmunro

This comment has been minimized.

@VAggrippino

This comment has been minimized.

@rjmunro

This comment has been minimized.

@mregydev
Copy link

mregydev commented Nov 8, 2018

This issue still exist in express 5.x in spite not being exist in Path-to-RegExp

app.get('/:number(\\d*\\d*)', (request, response) => { response.send(request.params.number) })

It donot work with one digit parameter (http://localhost/2) but works with more than one digit parameters (http://localhost/22)

Thanks

@dougwilson
Copy link
Contributor

This is now resolved in 5.0.0-beta.1

@VevoLiang

This comment was marked as spam.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment