Skip to content

Latest commit

 

History

History
129 lines (111 loc) · 5.99 KB

README.md

File metadata and controls

129 lines (111 loc) · 5.99 KB

css-to-xpath-converter

It's inspired by css2xpath, but it is too buggy.

This JavaScript version is ported from C# converter, which is intended to work with an HTML navigator inherited from System.Xml.XPath.XPathNavigator, but tests in Puppeteer and Crome show that generated XPathes also work in browsers. See Test results.

Important: the converter doesn't check validity neither of CSS selector nor of resulted XPath. So there may be cases when an application throw an error on parsing the XPath or the XPath isn't work as expected.
Although the converter is capture a lot of mistakes, but not all.

See online css-to-xpath-converter. It contains a lot of examples.

It allows using these CSS selectors:

Combinators

Selectors Description Remark
"+"
">"
"~"
"^" first child
"!" ancestors
"!^" last child
"!+" adjacent preceding sibling
"!>" parent
"!~" preceding sibling

Attribute selectors

Selectors Description Remark
"=" equals
"!=" not equals
"^=" starts with
"$=" ends with
"*=" contains within
"~=" contains exactly
" =" exactly or followed by a hyphen
[attr operator value i] to perform case-insensitive value comparison i or I

Pseudoclasses

Selectors Description Remark
":checked"
":contains()" text contains string
":disabled"
":empty"
":enabled"
":ends-with()" text ends with string
":eq()" equal to number same as ":nth()"
":first" first of selected elements
":first-child"
":first-of-type"
":gt()" select elements greater than number
":has()"
":has-ancestor()"
":has-parent()"
":has-sibling()"
":icontains()" text contains string ignore case
":iends-with()" text ends with string ignore case
":is()"
":istarts-with()" text starts with string ignore case
":last" last of selected elements
":last-child"
":last-of-type"
":limit()" select elements up to number
":lt()" select elements lesser than number
":not()"
":nth()" equal to number same as ":eq()"
":nth-child()"
":nth-last-child()"
":nth-of-type()"
":nth-last-of-type()"
":only-child"
":only-of-type"
":range()" select elements from smaller number to bigger number inclusive
":root" html element
":skip()" skip elements lesser than number
":skip-first"
":skip-last"
":starts-with()" text starts with string
":target" select elements with attribute 'href' starts with '#'
":text"

Usage:

const { xpath, css, warning, error } = toXPath(selector, options);
if (xpath) console.log(xpath);
else console.log(error);

Output object properties:

  • xpath - the generated XPath string or undefined if an error occur on parsing CSS selector
  • css - the normalized CSS selector string (stripped from unnecessary white spaces, comments)
  • warning - warning message or empty string
  • error - error message or empty string

Parameters:

  • selector - the input css selector string

  • options - the optional object :

    • axis - the XPath start axis. See online converter axes dropbox tooltips.

    • useClassName - this option change the XPath generation of attribute selector '[class operation value]'. See Class attribute non-standard and Class attribute standard. Non-standard approach is more flexible because it is dealing with individual classes instead of a whole className.

    • removeXPathSpaces - strips unnecessary space characters from the XPath (they are added for readability).

    • printError - the callback to send error message that the converter is detected, e.g. in online converter it is printed in the XPath editor.

    • debug - log an error message to the console (as the converter catches all errors internally, no error will be log to the console unless debug option is enable).

    • uppercaseLetters - custom uppercase letters string, using by converter to perform case-insensitive attribute value operations.
      It extends the default uppercase Latin alphabet. It must be compliant with lowercaseLetters.

    • lowercaseLetters - custom lowercase letters string, using by converter to perform case-insensitive attribute value operations.
      It extends the default lowercase Latin alphabet. It must be compliant with uppercaseLetters.

    The default options:

    const options = {
      axis : '//',
      useClassName : false,
      removeXPathSpaces : false,
      uppercaseLetters : '',
      lowercaseLetters : '',
      printError : (message) => {}
      debug : false
    };