xml-motor ~ what,why,how this xml-parsing rubygem

Slide 1

Slide 1 text

xmlmotor What it is : slide#2 Why you should use it : slide#3-6 How to use it : slide#7-12 AbhishekKr http://www.twitter.com/aBionic http://github.com/abhishekkr

Slide 2

Slide 2 text

Late 2011, started a new rubygem project for parsing xml, html. @Rubygems: http://rubygems.org/gems/xml-motor @GitHub : https://github.com/abhishekkr/rubygem_xml_motor Just created it to test out my work at compact, quick & easy xmlparsing algorithm... can see that @Slideshare: http://www.slideshare.net/AbhishekKr/xmlmotor So, currently this is a nonnative, completely independent lessthan250 rubyLOC available as a simple rubygem to be required and use in an easy freehand notation (like 'div.img') and match with any/multiple node attributes (like 'id=”a1”' or ['type=”color”', 'name=”white”']).

Slide 3

Slide 3 text

Current Features ● Has a single method access to parse require xml nodes from content or file. ● Use it only if you are gonna parse that xmlcontent once. ● For using same xmlcontent more than once, follow the 3way step mentioned in examples on end slides. ● It doesn't depend on presence of any other system library, purely nonnative. ● It parses broken or corrupted xml/html content correctly, just for the content it have. ● Can parse results on looking for nodenames, attributes of node or both.

Slide 4

Slide 4 text

Uses freefreehand notation to retrieve xml nodes. If your xml looks like, '... ABC CBA ... XYZ XYYZ ... ' and you look for 'book.author', then, you'll get back ['CBA', 'XY', 'YZ']; What that means is the childnode could be at any depth in the parentnode. Default return mode is without the tags, there is a switch to get the nodes.

Slide 5

Slide 5 text

To filter your nodes on the basis of attributes, single or multiple attributes can be provided. These attribute searches can be combined up with freehand node name searches. Readme (a bit weird, have to loosen it up): https://raw.github.com/abhishekkr/rubygem_xm l_motor/master/README

Slide 6

Slide 6 text

Features To Come Work on making it more performance efficient. Limit over resultnodes retrieved from start/end of matching nodes. Multinode attributebased filter for a hierarchical node search. Add more common CSS Selector style, capability is already present using attribute based search... just need to add a mapping method.

Slide 7

Slide 7 text

USAGE code we are going to try: https://github.com/abhishekkr/axml-motor/tree/master/ruby/examples

Slide 8

Slide 8 text

say, you have an xml file 'dummy.xml', with data as non-native compact easy

Slide 9

Slide 9 text

its available at rubygems.org, install it as $ gem install xmlmotor include it in your ruby code, #!/usr/bin/env ruby require 'xmlmotor' get the XML Filename and/or XML data available fyl = File.join(File.expand_path (File.dirname __FILE__),'dummy.xml') xml = File.open(fyl,'r'){|fr| fr.read }

Slide 10

Slide 10 text

One-time XML-Parsing directly from file XMLMotor.get_node_from_file (fyl, 'ummy.mmy', 'class="sys"') Result: ["nonnative", "\n compact\n "] One-time XML-Parsing directly from content XMLMotor.get_node_from_content (xml, 'dummy.my', 'class="usage"') Result: ["easy"]

Slide 11

Slide 11 text

Way to go for XML-Parsing for xml node searches xsplit = XMLMotor.splitter xml xtags = XMLMotor.indexify xsplit [] just normal node name based freehand notation to search: XMLMotor.xmldata (xsplit, xtags, 'dummy.my') Result: ["compact", "easy"] [] searching for values of required nodes filtered by attribute: XMLMotor.xmldata (xsplit, xtags, nil, 'class="usage"') Result: ["easy"]

Slide 12

Slide 12 text

[] searching for values of required nodes filtered by freehand tag-name notation & attribute: XMLMotor.xmldata(xsplit, xtags, 'dummy.my', 'class="usage"') Result: ["easy"] [] searching for values of required nodes filtered by freehand tag-name notation & multiple attributes: XMLMotor.xmldata(xsplit, xtags, 'dummy.my', ['class="sys"', 'id="mem"']) Result: ["compact"]