SQL - Embedded SQL Programming
===============================

- In this section, we will look at a different paradigm
  for writing programs that incorporate SQL queries.

- The main distinction between this model and others is that the
  program is written in a host language like C, but contains
  constructs that are foreign to the host language.

- One thing you will notice that it requires programmers to work on
  very low level details of communication with the database. 
  
- To be able to compile these programs, we must first precompile using
  a special program, which will rewrite the program code by replacing
  pieces of it.

- Once precompilation is finished, we compile the remaining code.

- Embedded SQL, ESQL is an industry standard language.   

Overview
------------

- Embedded SQL is an SQL standard for writing a program in a host
  language (like C) with SQL statements starting with the string:

  ::

     EXEC SQL 

  and ending with semicolon (;).
  
- In addition, all variables to be used by the program as input/output
  to a query must be declared within a declare section.

- Often type conversion for preliminary data types between the
  programming language and SQL is done by hand.

- Proc in Oracle, ECPG in Postgresql implements the C embeddings for
  SQL.


- Note that the following notes are based on Oracle embedded SQL
  language (slight differences are possible for postgresql)

- Example program:
  

  ::

     #include <stdio.h>
     exec sql include sqlca;

     char user_prompt[] = "Please enter username and password:  ";
     char cid_prompt[] = "Please enter customer ID:  ";

     int main()
     {
         exec sql begin declare section;       /* declare SQL host variables    */
             char cust_id[5];
             char cust_name[14];
             float cust_discnt;                  /* host var for discnt value    */
             char user_name[20];     
         exec sql end declare section;
    
         exec sql whenever sqlerror goto report_error; /* error trap condition     */
         exec sql whenever not found goto notfound; /* not found condition      */
    
         exec sql unix:postgresql://csc4380.cs.rpi.edu/sibel AS myconnection USER :user_name;
         /* ORACLE format: connect  */
    
         while (prompt(cid_prompt, 1, cust_id, 4) >= 0) {  
             exec sql select cname, discnt 
                     into :cust_name, :cust_discnt   /* retrieve cname, discnt   */
                     from customers where cid = :cust_id;
             exec sql commit work;                     /* release read lock on row */
            
             printf("CUSTOMER'S NAME IS  %s AND DISCNT IS  %5.1f\n",
                  cust_name, cust_discnt);            /* NOTE, (:) not used here  */
             continue;                                 
         }
     }	   
     
ESQL
-------
- Each ESQL statement starts with EXEC SQL keyword and ends with a
  semicolon ;
  
- A pre-compiler will scan a program file and only read the statements
  enclosed within EXEC SQL statements and disregard everything else.

- SQLCA is a specific data structure for storing status codes of all SQL operations

  ::

     /* always have this for error handling*/
     exec sql include sqlca ;             

Connections
----------------

- To be able to perform any operations, we must open a connection to the database. 

  ::

     EXEC SQL CONNECT TO target [AS connection-name] [USER user-name];
     
- Many connection can be opened in a program, but generally one
  connection per database is sufficient.
  
- Different databases can be used in a single program.
  
- Close all connections before the program exists:

  ::

     EXEC SQL DISCONNECT [connection];
     
- Change between multiple open connections with:

  ::

     EXEC SQL SET CONNECTION connection-name;
     
Variables in ESQL
------------------

- All variables that will be used in an SQL statement must be declared
  using an ESQL declaration and data type

  ::
     
     EXEC SQL BEGIN DECLARE SECTION ;
     VARCHAR    e_name[30], username[30] ;
     INTEGER     e_ssn, e_dept_id ;
     EXEC SQL END DECLARE SECTION ;
     
- You can use almost any SQL command in ESQL as long as proper input
  to these commands are provided in the form of program variables.
  
Executing SQL commands
-------------------------

- Suppose we want to find the name of an employee given his/her SSN
  (input by the user of the program):
  
  ::

     EXEC SQL select name, dept_id into :e_name, :e_dept_id
     from employee
     where ssn = :e_ssn ;

  - Program variables are preceded by ":", i.e. :e_ssn.

- Read the value of the variable “e_ssn” and execute the SQL statement
    using this value, store the returned values of columns "name" and
    "dept_id" in the program variables "e_name" and "e_dept_id".
    
- Compare the above query with the expression below. What is the difference?

  ::

     EXEC SQL select name, dept_id 
     from employee  where ssn = e_ssn ;

     
Executing SQL commands
-----------------------

- We are able to write:

  ::

     EXEC SQL select name, dept_id into :e_name, :e_dept_id
              from employee
              where ssn = :e_ssn ;

- Since this query returns a single tuple. For this tuple, we read the
  returned values.

- We will see how to handle queries that return multiple tuples in a minute.
  
Dealing with Strings
---------------------

- There is a mismatch between the definition of a string in Oracle and in C/C++. 
- In C, the end of a string is identified by the null character
  '\0'. Hence, "Sibel" would be stored as characters
  'S','i','b','e','l','\0'.
  
- In Oracle, the length of a string is stored together with the string
  and there is no special end of string character.
  
- If you convert a data string from Oracle to C, you must pad it with
  '\0' manually!
  
- The data type VARCHAR e_name[30] is translated by the pre-compiler
  to the following structure:
  
  ::
     
     struct {
         unsigned short len
         unsigned char arr[30]
     } e_name ;

- Putting the pieces together:

  ::

     strcpy(username.arr, “Sibel Adali") ;
     username.len = strlen(“Sibel Adali") ;
     strcpy(passwd.arr, “tweety-bird") ;
     passwd.len = strlen(“tweety-bird") ;
     exec sql 	connect :username  identified by :passwd ;
     scanf(“%d", &e_ssn) ;
     exec sql 	select name, dept_id into :e_name, :e_dept_id
                from employee where ssn = :e_ssn ;
     e_name.arr[e_name.len] = '\0' ;   /* so can use string in C*/
     printf(“%s", e_name.arr) ;
     exec sql commit work ;  /* make any changes permanent */
     exec sql disconnect ;     /* disconnect from the database */
     
Status Processing
--------------------

- SQL Communications area is a data structure that contains information about

  - Error codes (might be more detailed than SQLSTATE)
  - Warning flags 
  - Event information 
  - Rows-processed count 
  - Diagnostics for all processed SQL statements.
    
- Included in the program using

  ::

     EXEC SQL INCLUDE SQLCA; or #include <sqlca.h>
     
- It is possible to get the full text of error messages and other
  detailed status information.
  
- Whenever an SQL statement is executed, its status is returned in a
  variable named ``"SQLSTATE"``
  
- This variable must be defined in the variable section, but it is
  populated with values automatically

  ::

     EXEC SQL BEGIN DECLARE SECTION;
       char    SQLSTATE[6] ;
     EXEC SQL END DECLARE SECTION;
     
- Different errors and conditions have values that might be vendor specific.
  
Status processing
-------------------

- ``sqlca`` covers both warnings and errors. If multiple warnings or
  errors occur during the execution of a statement, then sqlca will
  only contain information about the last one.
  
- If no error occurred in the last SQL statement, sqlca.sqlcode will
  be 0 and sqlca.sqlstate will be "00000".
  
- If a warning or error occurred, then sqlca.sqlcode will be negative
  and sqlca.sqlstate will be different from "00000".
   
- If the last SQL statement was successful, then sqlca.sqlerrd[1]
  contains the OID of the processed row, if applicable, and
  sqlca.sqlerrd[2] contains the number of processed or returned rows,
  if applicable to the command.
  
- The code can be checked after each statement and error handling code
  can be written

  - Commit, rollback
  - Exit program, etc.

  ::
     
     if (strcmp(SQLSTATE, "000000") != 0)
          rollback ;
	  
- It is also possible to use trap conditions that remain active
  throughout the program.

  ::
     
     EXEC SQL WHENEVER <condition> <action> ;
     
  - Conditions: ``SQLERROR``, ``SQLWARNING``, ``NOT FOUND``
  - Actions: ``DO function``, ``DO break``, ``GOTO label``,
    ``CONTINUE``, ``STOP``
    
- Because WHENEVER is a declarative statement, its scope is
  positional, not logical. That is, it tests all executable SQL
  statements that physically follow it in the source file, not in the
  flow of program logic.
  
- A WHENEVER directive stays in effect until superseded by another
  WHENEVER directive checking for the same condition.
  
Transactions
-----------------

- Transactions start with the logically start with the first SQL
  statement and end with either a COMMIT or ROLLBACK statement
  
- It is possible to set boundaries of transactions with the SQL statement:

  ::

     BEGIN ;
     SET TRANSACTION READ ONLY
         ISOLATION LEVEL READ COMMITTED
         DIAGNOSTICS SIZE 6 ;
	 
- Diagnostics size is the number of exception conditions that can be
  described at one time in the status.
  
- READ ONLY, READ/WRITE transactions allow the programmer to define
  the type of the transaction
  
- Isolation level allows the programmer to define the desired level of
  consistency
  
ESQL - Cursor Operations
--------------------------
- Declare a cursor using a regular SQL query (no "into").

  ::
       
     EXEC SQL DECLARE emps_dept CURSOR FOR
           select ssn, name from employee
	   where dept_id = :e_dept_id ;

- Open a cursor: means the corresponding SQL query is executed, the
  results are written to a file (or a data structure) and the cursor
  is pointing to the first row.

  ::

     EXEC SQL OPEN emps_dept ;

- Read the current row pointed to by the cursor using "fetch". At the
  end of fetch, the cursor is moved to point to the next tuple.

  ::

     EXEC SQL FETCH emps_dept INTO :e_ssn, :e_name ;

- How do we know when we reach the end of a cursor?

  - Check the "sqlcode" to see if the end of a cursor is reached (its
    expected value depends on the system).

    ::

       if (sqlca.sqlcode == -1) { … }

- Error handling statements

  ::

     EXEC SQL WHENEVER NOT FOUND {}

Cursors and snapshots
-----------------------

- If a cursor is declared as "INSENSITIVE", the contents of the cursor
  is computed once when the cursor is opened, and remains the same
  until the cursor is closed, even if the underlying data tables
  change during the this time.


  ::

     DECLARE cursor-name CURSOR INSENSITIVE CURSOR FOR table-expression

- This type of cursor is a snapshot of the database, a view of it at a
  specific time.


  ::
     
     DECLARE cursor_name [INSENSITIVE][SCROLL] CURSOR FOR
       table_expression
       [ORDER BY order-item-comma-list]
       [ FOR [READ ONLY | UPDATE | OF column-commalist] ]

Cursors for update
--------------------

- If a cursor is declared for update, then updates can be performed on
  the current tuple.

  ::

     DECLARE CURSOR cursor-name CURSOR FOR table-expression
                 FOR UPDATE OF column-list

     UPDATE table-name SET assignment-list 
                WHERE CURRENT OF cursor-name

     DELETE FROM table-name WHERE CURRENT OF cursor-name

- For these updates to have an effect, the cursor must not be
  INSENSITIVE.

Constraints
---------------

- When constraints are violated, they cause an exception (or sqlerror)
  to be thrown.

- When are constraints violated?

  - If constraint checking for a constraint is immediate, as soon as
    an SQL statement causes the constraint to become false, it is
    rolled back.

  - If a constraint is defined to be deferrable, then the constraint
    is not checked until a transaction tries to commit. Then, if it is
    violated, the whole transaction is rolled back.

    ::

       CONSTRAINT name CHECK … DEFERRABLE

       
Dynamic SQL
-------------

- In Dynamic SQL, embedded SQL statements are created on the fly using
  strings!

- these strings are fed to an exec sql statement

  ::

     exec sql execute immediate :sql_string

- Since dynamic SQL statements are not known to the pre-compiler at
  compile/initiation time, they must be optimized at run time!

- Create a query once using a prepare statement and run it multiple
  times using the execute statement.

  ::

     strcopy(sqltext.arr, "delete from employee where ssn = ?") ;
     sqltext.len=str.len(sqltext.arr) ;
     exec sql prepare del_emp from :sqltext ;
     exec sql execute del_emp using :cust_id ;

SQLDA
------------

- When we execute a dynamic SQL statement, we do not know which
  columns will be returned and how many columns will be returned.

- The SQLDA descriptor definition allows us to find the number of
  columns and the value for each column.

  ::
     
     exec sql include sqlda ;
     exec sql declare sel_curs cursor for sel_emps ;
     exec sql prepare sel_emps from :sqltext ;
     exec sql describe sel_emps into sqlda ;
     
     exec sql open sel_curs ;
     exec sql fetch sel_curs using descriptor sqlda ;