Slide 1

Slide 1 text

2 December 2005 Introduction to Databases Advanced SQL Prof. Beat Signer Department of Computer Science Vrije Universiteit Brussel beatsigner.com

Slide 2

Slide 2 text

Beat Signer - Department of Computer Science - [email protected] 2 March 20, 2019 Context of Today's Lecture Access Methods System Buffers Authorisation Control Integrity Checker Command Processor Program Object Code DDL Compiler File Manager Buffer Manager Recovery Manager Scheduler Query Optimiser Transaction Manager Query Compiler Queries Catalogue Manager DML Preprocessor Database Schema Application Programs Database Manager Data Manager DBMS Programmers Users DB Admins Based on 'Components of a DBMS', Database Systems, T. Connolly and C. Begg, Addison-Wesley 2010 Data, Indices and System Catalogue

Slide 3

Slide 3 text

Beat Signer - Department of Computer Science - [email protected] 3 March 20, 2019 Authorisation ▪ Different types of authorisations (privileges) can be assigned to a user via DDL ▪ insert, read, update and delete privileges ▪ The creator of a new relation is automatically given all privileges on that relation grantPrivilege = "GRANT" , "ALL PRIVILEGES" , "TO" , ( "PUBLIC" | grantee ) | "GRANT" , privilege , { "," , privilege } ) , "ON" , ( table | view ) , { "," , ( table | view ) } , "TO" , ( "PUBLIC" | grantee ) , [ "WITH GRANT OPTION" ]; privilege = "INSERT" | "SELECT" | ( "UPDATE" , [ "(" , column , { "," , column } , ")" ] ) | "DELETE"; grantee = ( user | role ) , { "," , ( user | role ) };

Slide 4

Slide 4 text

Beat Signer - Department of Computer Science - [email protected] 4 March 20, 2019 Authorisation ... ▪ An update privilege may be limited to certain attributes only ▪ Roles can be used to group users GRANT ALL PRIVILEGES TO Alice, Bob; GRANT INSERT, UPDATE (amount, status) ON Order TO Eve WITH GRANT OPTION; createRole = "CREATE ROLE" , role; grantRole = "GRANT" , role , "TO" ( "PUBLIC" | grantee ); CREATE ROLE PowerUser; GRANT PowerUser TO Alice, Bob;

Slide 5

Slide 5 text

Beat Signer - Department of Computer Science - [email protected] 5 March 20, 2019 Authorisation ... ▪ Privileges can be passed to other users if the WITH GRANT OPTION has been specified ▪ Privileges and roles can be removed via the REVOKE command (similar to GRANT statement) ▪ Examples ▪ Note that the revocation of a privilege may have an effect on grants that have been passed to other users ▪ authorisation graph has to be inspected REVOKE INSERT ON Order FROM Eve; REVOKE PowerUser FROM Bob;

Slide 6

Slide 6 text

Beat Signer - Department of Computer Science - [email protected] 6 March 20, 2019 Authorisation Graph ▪ Privilege revocation examples ▪ remove privilege from user1 (g1 ) → user4 still has privilege via g24 ▪ remove privilege from user2 (g2 ) → user2 still has privilege via g23 ▪ remove privilege from user3 (g3 ) → user3 still has privilege via g23 ▪ remove privilege from user2 (g2 ) and user3 (g3 ) - g23 still exists but is no longer part of a path starting from the admin → user2 and user3 no longer have privilege → user5 no longer has privilege user1 user2 user3 user4 user5 admin g1 g2 g24 g3 g14 g25 g23

Slide 7

Slide 7 text

Beat Signer - Department of Computer Science - [email protected] 7 March 20, 2019 Working with SQL ▪ There are two main modes to work with SQL ▪ via generic query interfaces or SQL application interfaces ▪ note that standard SQL is not Turing complete ▪ Generic SQL interface ▪ command-line or web-based interface ▪ answer single questions (one query) ▪ interactive query evaluation ▪ SQL interface for applications ▪ sequence of queries, inserts, updates and deletes ▪ SQL statements that are embedded within a host language ▪ query results are processed and reused in non-declarative actions - I/O interfaces - human-computer interaction

Slide 8

Slide 8 text

Beat Signer - Department of Computer Science - [email protected] 8 March 20, 2019 Impedance Mismatch Problem ▪ Combination of SQL with a host language ▪ mix of declarative and procedural programming paradigms ▪ two completely different data models - relational model with sets and bags - no pointers, loops and branches in basic SQL - no NULL values in typical host languages ▪ different set of data types - many programming languages do not offer Date or Time data types ▪ Interfacing with SQL is not straightforward ▪ data has to be converted between the host language and SQL due to the impedance mismatch ▪ ~30% of the code and effort is used for this conversion! ▪ various libraries to simplify the mapping (e.g. Hibernate)

Slide 9

Slide 9 text

Beat Signer - Department of Computer Science - [email protected] 9 March 20, 2019 Impedance Mismatch Problem ... ▪ Why not simply use a single language? ▪ both languages have their domain-specific strengths /** * Transforms the Swiss Projection * coordinates (y, x, h') to the * ellipsoidic WGS84 coordinates (phi, * lambda, h) using and approach of * [U. Marti] based on the formulas of * [Bolliger, 1967]. * @param swissCoordinates the Swiss * coordinates to be transformed. * @return the WGS84 coordinates for * the specified Swiss coordinates. */ public static final WGS84Coordinates transformSwissCoordinatesToWGS84( SwissCoordinates swissCoordinates) { double yPrime = (swissCoordinates.getY() - 600000) / 1000000; double xPrime = (swissCoordinates.getX() - 200000) / 1000000; double xPrimePow2 = xPrime * xPrime; double xPrimePow3 = xPrimePow2 * xPrime; double yPrimePow2 = yPrime * yPrime; double yPrimePow3 = yPrimePow2 * ... Impedance Mismatch Application Code Database

Slide 10

Slide 10 text

Beat Signer - Department of Computer Science - [email protected] 10 March 20, 2019 SQL Programming Environments ▪ Embedded SQL (ESQL) ▪ integration of SQL in a host programming language ▪ e.g. interfacing with C through ESQL/C ▪ SQL Call-Level Interface (SQL/CLI) ▪ e.g. interfacing with Java through JDBC ▪ SQL Persistent Stored Modules (SQL/PSM) ▪ stored procedures ▪ Triggers

Slide 11

Slide 11 text

Beat Signer - Department of Computer Science - [email protected] 11 March 20, 2019 Database Applications ▪ Task sharing between client and server ▪ client does data presentation ▪ server does data management ▪ Where should we implement the application logic? ▪ on the client - ESQL, SQL/CLI, ODBC, JDBC, ... ▪ on the server - SQL/PSM, Stored Procedures, Triggers ▪ thin client vs. thick client

Slide 12

Slide 12 text

Beat Signer - Department of Computer Science - [email protected] 12 March 20, 2019 Example Database customerID name street postcode city orderID customerID cdID date amount status Customer CD Order name address Producer cdID title duration price year producer CREATE TABLE Customer ( customerID INTEGER CHECK (customerID > 0) PRIMARY KEY, name VARCHAR(30) NOT NULL, street VARCHAR(30) NOT NULL, postcode SMALLINT CHECK (postcode > 0), city VARCHAR(20) );

Slide 13

Slide 13 text

Beat Signer - Department of Computer Science - [email protected] 13 March 20, 2019 Example Database ... CREATE TABLE Producer ( name VARCHAR(40) PRIMARY KEY, address VARCHAR(30) ); CREATE TABLE Order ( orderID INTEGER CHECK (orderID > 0) PRIMARY KEY, customerID INTEGER, cdID INTEGER, date DATE, amount INTEGER, Status VARCHAR(20) NOT NULL DEFAULT 'open', UNIQUE (customerID, cdID, date), FOREIGN KEY (customerID) REFERENCES Customer(customerID) ON UPDATE CASCADE ON DELETE SET NULL, FOREIGN KEY (cdID) REFERENCES CD(cdID) ON UPDATE CASCADE );

Slide 14

Slide 14 text

Beat Signer - Department of Computer Science - [email protected] 14 March 20, 2019 Example Database ... CREATE TABLE CD ( cdID INTEGER PRIMARY KEY, title VARCHAR(30) NOT NULL, duration SMALLINT, price SMALLINT, year SMALLINT, producer VARCHAR(40), FOREIGN KEY (producer) REFERENCES Producer(name) );

Slide 15

Slide 15 text

Beat Signer - Department of Computer Science - [email protected] 15 March 20, 2019 Embedded SQL ▪ The expressiveness of SQL can be enhanced by embedding SQL statements into application code ▪ Use of a preprocessor ▪ transforms SQL statements into function calls in the host language ▪ type checks, syntax and semantic validation ▪ error handling ▪ Data interchange via shared variables ▪ shared variables are used in SQL statements as well as in statements of the host language ▪ A special SQLSTATE variable is used to connect the host language with the SQL execution engine

Slide 16

Slide 16 text

Beat Signer - Department of Computer Science - [email protected] 16 March 20, 2019 Embedded SQL Program Processing ▪ The preprocessor transforms a program with embedded SQL statements into the host language ▪ e.g. nsqlprep.exe for ESQL-C ▪ The compiler produces the linked program object code Host language with Embedded SQL Host language with function calls Program object code Embedded SQL preprocessor Host language compiler SQL library

Slide 17

Slide 17 text

Beat Signer - Department of Computer Science - [email protected] 17 March 20, 2019 Call Level Interface ▪ SQL library functions are called directly ▪ not tied to particular DBMS (drivers stored separately) ▪ Various implementations exist ▪ SQL/CLI for C ▪ Open Database Connectivity (ODBC) and its successor OLE DB ▪ Java™ Database Connectivity (JDBC) ▪ ActiveX® Data Objects (ADO)

Slide 18

Slide 18 text

Beat Signer - Department of Computer Science - [email protected] 18 March 20, 2019 Java Database Connectivity (JDBC) ▪ The following JDBC packages are part of Java SE 10 ▪ java.sql.* - basic JDBC functionality for J2SE ▪ javax.sql.* - supplemental extensions for J2EE ▪ The standardised Java API provides database connectivity through vendor-specific drivers ▪ Limited code portability among SQL platforms due to vendor-specific SQL dialects

Slide 19

Slide 19 text

Beat Signer - Department of Computer Science - [email protected] 19 March 20, 2019 Opening a JDBC Connection ▪ The JDBC URL varies for different drivers ▪ e.g. jdbc:microsoft:sqlserver for Microsoft SQL Server public Connection openConnection() { try { Connection conn = DriverManager.getConnection( "jdbc:microsoft:sqlserver://localhost:1433", "fred", "password"); return conn; } catch (SQLException e) { System.out.println("Opening JDBC Connection failed."); return null; } }

Slide 20

Slide 20 text

Beat Signer - Department of Computer Science - [email protected] 20 March 20, 2019 Statements ▪ There are three different kinds of statements ▪ Statement - general statement ▪ PreparedStatement - precompiled statement - more efficient if the same query is executed multiple times ▪ CallableStatement - used to call stored procedures ▪ Methods for executing a statement ▪ executeUpdate(String sql) ▪ executeQuery(String sql) ▪ No parameters are required for PreparedStatements

Slide 21

Slide 21 text

Beat Signer - Department of Computer Science - [email protected] 21 March 20, 2019 Statements ... void insertProducer(String name, String address) { try { Connection conn = this.openConnection(); Statement s = conn.createStatement(); s.executeUpdate("INSERT INTO Producer VALUES ('" + name + "', '" + address + "')"); } catch (SQLException e) { System.out.println("Inserting \"" + name + "\" failed."); } } public void insertProducer(String name, String address) { try { Connection conn = this.openConnection(); Statement s = conn.prepareStatement("INSERT INTO Producer VALUES ('" + name + "', " + address + ")"); s.executeUpdate(); } catch (SQLException e) { System.out.println("Inserting \"" + name + "\" failed."); } }

Slide 22

Slide 22 text

Beat Signer - Department of Computer Science - [email protected] 22 March 20, 2019 Result Set ▪ JDBC represents cursors as ResultSet ▪ API offers navigation methods on a ResultSet ▪ next(), previous() ▪ first(), last() ▪ absolute(int row), relative(int rows) ▪ API offers various get methods to fetch data as well as update methods for common SQL data types

Slide 23

Slide 23 text

Beat Signer - Department of Computer Science - [email protected] 23 March 20, 2019 Result Set Example ▪ Note that it would be easier to use the SQL AVG operator public float getAverageCDLength() { float result = 0.0; try { Connection conn = this.openConnection(); Statement s = conn.createStatement(); ResultSet set = s.executeQuery("SELECT length FROM CD"); int i = 0; while (set.next()) { result += set.getInt(1); i++; } return result/i; } catch (SQLException e) { System.out.println("Calculation of average length failed."); return 0; } }

Slide 24

Slide 24 text

Beat Signer - Department of Computer Science - [email protected] 24 March 20, 2019 Parameter Parsing ▪ Prepared statements can be parameterised ▪ flexibility to use different values in a given query ▪ A question mark (?) is used as a placeholder in the query string ▪ Various methods to insert data at runtime ▪ setString(int index, String s) ▪ setInt(int index, int i) ▪ setBoolean(int index, boolean b) ▪ setDate(int index, Date d) ▪ ...

Slide 25

Slide 25 text

Beat Signer - Department of Computer Science - [email protected] 25 March 20, 2019 Parameter Parsing Example public void insertProducers(List producers) { try { Connection conn = this.openConnection(); Statement s = conn.prepareStamement( "INSERT INTO Producer(name, address) VALUES(?, ?)"); for (Producer producer : producers) { s.setString(1, producer.getName()); s.setString(2, producer.getAddress()); s.executeUpdate(); } } catch (SQLException e) { System.out.println("Insertion of Producers failed."); } }

Slide 26

Slide 26 text

Beat Signer - Department of Computer Science - [email protected] 26 March 20, 2019 Special Datatypes and NULL Values ▪ Some SQL datatypes cannot be easily mapped to Java datatypes ▪ JDBC offers implementations for these types ▪ Clob, Blob, Date, Time, Timestamp ▪ Null values are handled via special methods ▪ wasNull() ▪ updateNull(int index) ▪ setNull(int index, int sqlType)

Slide 27

Slide 27 text

Beat Signer - Department of Computer Science - [email protected] 27 March 20, 2019 JDBC Summary ▪ Concepts of ESQL and JDBC are similar ▪ State of the art for Java database programming ▪ Generic JDBC-ODBC driver if no specific JDBC driver is available ▪ Java Data Objects (JDO) ▪ enables Java programmers to directly store their Java domain model instances in a database ▪ hides SQL from the programmer ▪ https://java.sun.com/javase/technologies/database/

Slide 28

Slide 28 text

Beat Signer - Department of Computer Science - [email protected] 28 March 20, 2019 Application Logic on the Server Side ▪ Stored Procedures ▪ PL/SQL - Oracle ▪ Transact-SQL - Microsoft ▪ standardised as SQL/PSM in SQL-99 - Persistent Stored Modules ▪ Triggers ▪ event driven execution of application logic ▪ User Defined Types ▪ standardised in SQL-99

Slide 29

Slide 29 text

Beat Signer - Department of Computer Science - [email protected] 29 March 20, 2019 SQL Persistent Stored Modules ▪ SQL is extended with constructs known from high-level programming languages ▪ assignments, branches, loops, subroutines and exceptions createProcedure = "CREATE PROCEDURE" , name , "(" , parameters , ")" , localDeclarations , procedureBody; createFunction = "CREATE FUNCTION" , name , "(" , parameters , ")" , "RETURNS" , type , localDeclarations , functionBody;

Slide 30

Slide 30 text

Beat Signer - Department of Computer Science - [email protected] 30 March 20, 2019 Parameters ▪ SQL/PSM defines three types for parameters ▪ IN - input only ▪ OUT - output only ▪ INOUT - both input and output ▪ The default mode is IN ▪ Procedure parameters can have any mode ▪ Function parameters can have IN mode only ▪ result given back only via the return value

Slide 31

Slide 31 text

Beat Signer - Department of Computer Science - [email protected] 31 March 20, 2019 Procedure Example ▪ Procedure parameters can have any mode ▪ DB administrators often create these type of procedures (for maintenance) CREATE PROCEDURE RelocateProducer( IN oldAddress VARCHAR(30), IN newAddress VARCHAR(30) ) UPDATE Producer SET address = newAddress WHERE address = oldAddress;

Slide 32

Slide 32 text

Beat Signer - Department of Computer Science - [email protected] 32 March 20, 2019 Function Example CREATE FUNCTION AverageCDLengthOfProducer( IN name VARCHAR(30), ) RETURNS INTEGER DECLARE result INTEGER; BEGIN SELECT AVG(length) INTO result FROM CD WHERE producer = name; RETURN result; END;

Slide 33

Slide 33 text

Beat Signer - Department of Computer Science - [email protected] 33 March 20, 2019 SQL/PSM Statements ▪ Procedure calls ▪ Function evaluation ▪ functions cannot be called ▪ functions are directly invoked as parts of expressions ▪ Return statement ▪ does not terminate the function ▪ additional RETURN statements may cause the return value to change after an initial return statement CALL name (arguments) RETURN expression

Slide 34

Slide 34 text

Beat Signer - Department of Computer Science - [email protected] 34 March 20, 2019 SQL/PSM Statements ... ▪ Local variable declaration ▪ Assignments ▪ NULL is a permissible expression ▪ Compound statement DECLARE name type SET variable = expression BEGIN ... END

Slide 35

Slide 35 text

Beat Signer - Department of Computer Science - [email protected] 35 March 20, 2019 Branch Statement ▪ If statement ▪ ELSEIFs and final ELSE are optional IF condition THEN statements ELSEIF condition THEN statements ELSE statements ENDIF

Slide 36

Slide 36 text

Beat Signer - Department of Computer Science - [email protected] 36 March 20, 2019 Branch Example CREATE FUNCTION IsExpensive( IN cdTitle CHAR(30) ) RETURNS BOOLEAN IF NOT EXISTS(SELECT * FROM CD WHERE title = cdTitle) THEN RETURN FALSE; ELSEIF 50 < (SELECT MAX(price) FROM CD WHERE title = cdTitle) THEN RETURN TRUE; ELSE RETURN FALSE; ENDIF;

Slide 37

Slide 37 text

Beat Signer - Department of Computer Science - [email protected] 37 March 20, 2019 Loop Statements ▪ Basic loop statement ▪ The loop can be terminated with LEAVE label ▪ the label identifies which loop has to be aborted label: LOOP statements END LOOP

Slide 38

Slide 38 text

Beat Signer - Department of Computer Science - [email protected] 38 March 20, 2019 Loop Statements ... ▪ While loop statement ▪ Repeat loop statement WHILE condition DO statements END WHILE REPEAT statements UNTIL condition END REPEAT

Slide 39

Slide 39 text

Beat Signer - Department of Computer Science - [email protected] 39 March 20, 2019 Loop Statements ... ▪ For loop statement ▪ Convenience statement to iterate over cursor ▪ open / close cursor ▪ check if more tuples to be fetched ▪ Name and cursor are needed for transformation into basic loop (by PSM interpreter) FOR name AS cursor CURSOR FOR query DO statements END FOR

Slide 40

Slide 40 text

Beat Signer - Department of Computer Science - [email protected] 40 March 20, 2019 Loop Example CREATE FUNCTION LengthVarianceOfProducer( IN name VARCHAR(40), ) RETURNS REAL DECLARE variance REAL; DECLARE mean REAL; DECLARE count INTEGER; BEGIN SET variance = 0.0; SET mean = 0.0; SET count = 0; FOR cdLoop AS cdCursor CURSOR FOR SELECT length FROM CD WHERE producer = name; DO SET count = count + 1; SET variance = variance + length * length; SET mean = mean + length; END FOR; SET mean = mean/count; RETURN variance/count – mean * mean; END;

Slide 41

Slide 41 text

Beat Signer - Department of Computer Science - [email protected] 41 March 20, 2019 SQL/PSM Summary ▪ Perform computation on the server side ▪ Implement functionality not contained in SQL ▪ recursion ▪ closure ▪ mathematical and statistical functions ▪ SQL in combination with SQL/PSM forms a Turing complete programming language ▪ Functionality for administrative tasks is often implemented as PSM

Slide 42

Slide 42 text

Beat Signer - Department of Computer Science - [email protected] 42 March 20, 2019 Transitive Closure (Recursion) ▪ Inconvenient to specify transitive closure using iteration ▪ Since SQL:1999 there is a form of recursion using the WITH RECURSIVE clause ▪ Example ▪ "Find all destinations that can be reached (directly or indirectly) from Zurich Airport" from to ZRH BRU ZRH JFK BRU CDG JFK BRU ... ... Flight

Slide 43

Slide 43 text

Beat Signer - Department of Computer Science - [email protected] 43 March 20, 2019 Transitive Closure (Recursion) ... ▪ Recursive view is a union of two subqueries ▪ base query ▪ recursive query making use of the recursive view - recursive query should be monotonic (e.g. no aggregation in recursive view) ▪ Repeat until no new tuples added and fixpoint is reached WITH RECURSIVE Connection(from, to) AS ( SELECT from, to FROM Flight UNION SELECT Flight.from, Connection.to FROM Flight, Connection WHERE Flight.to = Connection.from ) SELECT DISTINCT to FROM Connection WHERE Connection.from = 'ZRH';

Slide 44

Slide 44 text

Beat Signer - Department of Computer Science - [email protected] 44 March 20, 2019 Triggers ▪ A trigger is a statement that the DBMS executes as a side effect of a modification to the database ▪ The definition of a trigger follows the so-called event-condition-action (ECA) model ▪ event - specifies the signal that triggers the rule (e.g. insert, update or delete) ▪ condition - the condition that has to be satisfied for the execution of the action ▪ action - the action to be executed ▪ Triggers are a powerful tool for checks before/after an operation on a relation (e.g. for integrity constraints)

Slide 45

Slide 45 text

Beat Signer - Department of Computer Science - [email protected] 45 March 20, 2019 Triggers ... ▪ Example ▪ When data is restored from a backup or replicated, triggers often have to be disabled ▪ otherwise triggers might be executed multiple times ▪ The execution of a trigger can lead to a (infinite) cascade of other triggers that get fired ▪ often the length of such a trigger chain is limited by the system CREATE TRIGGER setNull BEFORE UPDATE ON Producer REFERENCING NEW ROW AS newRow FOR EACH ROW WHEN newRow.address = '' SET newRow.address = NULL;

Slide 46

Slide 46 text

Beat Signer - Department of Computer Science - [email protected] 46 March 20, 2019 Homework ▪ Study the following chapter of the Database System Concepts book ▪ chapter 5 - sections 5.1-5.4, 5.7 - Advanced SQL

Slide 47

Slide 47 text

Beat Signer - Department of Computer Science - [email protected] 47 March 20, 2019 Exercise 6 ▪ Structured Query Language (SQL) ▪

Slide 48

Slide 48 text

Beat Signer - Department of Computer Science - [email protected] 48 March 20, 2019 References ▪ A. Silberschatz, H. Korth and S. Sudarshan, Database System Concepts (Sixth Edition), McGraw-Hill, 2010 ▪ H. Garcia-Molina, J.D. Ullman and J. Widom, Database Systems: The Complete Book, Prentice Hall, 2002

Slide 49

Slide 49 text

2 December 2005 Next Lecture DBMS Architectures and Features