VoiceXML Review - Columns

Welcome to First Words, VoiceXML Review's column that teaches you about VoiceXML and how you can use it. We hope you enjoy the lesson.

Last month we talked about using the <transfer> tag to connect your callers to other services or people. This month, we're going to do a little call screening, and try out the <record> tag.

Of course, whenever a call takes place, you have access to telephony-related information about the call. This information includes the dialed number (session.telephone.dnis), the calling number (session.telephone.ani), if available, and possibly additional network information (User-to-user information, UUI, as session.telephone.uui, and Information digits as session.telephone.iidigits). This telephony-related information allows you to tailor your application based on who calling whom, where the call if made from, and so on. UUI can be used as required by the application, and the information digits provide useful information about the call.

The <record> tag allows you to record what the caller is saying, and then to make use of that recorded data. The <record> tag is used as a form item for collecting input within a form, and shares a number of characteristics with other form items such as <field>. The recorded data is available as the form item variable associated with the <record> tag.

This data can then be used as you would expect; it can be played back to a caller, and it can be submitted to a web server. The <record> tag can be useful in many types of applications, as I'm sure you can imagine. Some examples might include Voicemail systems, E-mail by phone, collecting general comments or requests, and so on.

So how do we use this wondrous capability that the <record> tag gives us? Here is a simple example.

<?xml version="1.0"?>
<vxml version="1.0">
  <form>
    <record name="recorded_message" type="audio/wav" maxtime="30s"
    dtmfterm="true">
      <prompt>
        Please record something. Press any key to stop recording.
      </prompt>
      <filled>
        <prompt>
          You have recorded your message. Here is what it sound like.
          <value expr="recorded_message" />
        </prompt>
      </filled>
    </record>
  </form>
</vxml>

This example will prompt the caller, and then record their input (up to thirty seconds worth) and then play it back to them as part of another prompt. If the caller wishes to terminate recording, they can press a DTMF key. Although it doesn't matter in this example, we have specified that the file should be saved in WAVE format, as indicated by the content type 'audio/wav'.

As with other form items, the <record> element contains other elements that define the behavior while collecting the data from the user. In this example, we have:

This set of attributes allows us to control the collection of our audio data. When the recording has been made, a number of shadow variables are defined:

<?xml version="1.0"?>
<vxml version="2.0">
    <form>
        <record beep="true" name="recorded_message" type="audio/wav" maxtime="30s"
        dtmfterm="true">
            <prompt>
               Please record something. Press any key to stop recording.
            </prompt>
            <noinput>
               You really should say something.
            </noinput>
            <filled>
                <prompt>
                    You have recorded your message. Here is what it sound like.
                    <value expr="recorded_message" />
                </prompt>
                <if cond="recorded_message$.maxtime == 'true' ">
                    <prompt>
                        You talk too much! Your message was truncated.
                    </prompt>
                </if>
                <submit next="/cgi-bin/record.pl" method="post" />
          </filled>
          <catch event="telephone.disconnect.hangup">
                <submit next="/cgi-bin/record.pl" method="post" />
          </catch>
       </record>
  </form>
</vxml>

The Record Tag

By Rob Marchand