Azure平台 对Twitter 推文关键字进行实时大数据分析

简介: Learn how to do real-time sentiment analysis of big data using HBase in an HDInsight (Hadoop) cluster.

Learn how to do real-time sentiment analysis of big data using HBase in an HDInsight (Hadoop) cluster.

Social web sites are one of the major driving forces for Big Data adoption. Public APIs provided by sites like Twitter are a useful source of data for analyzing and understanding popular trends. In this tutorial, you will develop a console streaming service application and an ASP.NET Web application to perform the following:

  • Get geo-tagged Tweets in real-time using the Twitter streaming API.
  • Evaluate the sentiment of these Tweets.
  • Store the sentiment information in HBase using the Microsoft HBase SDK.
  • Plot the real-time statistical results on Bing maps using an ASP.NET Web application. A visualization of the tweets will look something like this:

    hdinsight.hbase.twitter.sentiment.bing.map

    You will be able to query tweets with certain keywords to get a sense of the expressed opinion in tweets is positive, negative, or neutral.

A complete Visual Studio solution sample can be found at https://github.com/maxluk/tweet-sentiment.

In this article

Prerequisites

Before you begin this tutorial, you must have the following:

  • An HBase cluster in HDInsight. For instructions on cluster provision, see Get started using HBase with Hadoop in HDInsight. You will need the following data to go through the tutorial:

    CLUSTER PROPERTY DESCRIPTION
    HBase cluster name This is your HDInsight HBase cluster name. For example: https://myhbase.azurehdinsight.net/
    Cluster user name The Hadoop user account name. The default Hadoop username is admin.
    Cluster user password The Hadoop cluster user password.
  • A workstation with Visual Studio 2013 installed. For instructions, see Installing Visual Studio.

Create a Twitter application ID and secrets

The Twitter Streaming APIs use OAuth to authorize requests.

The first step to use OAuth is to create a new application on the Twitter Developer site.

To create Twitter application ID and secrets:

  1. Sign in to https://apps.twitter.com/.Click the Sign up now link if you don't have a Twitter account.
  2. Click Create New App.
  3. Enter Name, Description, Website. The Website field is not really used. It doesn't have to be a valid URL. The following table shows some sample values to use:

    FIELD VALUE
    Name MyHDInsightHBaseApp
    Description MyHDInsightHBaseApp
    Website http://www.myhdinsighthbaseapp.com
  4. Check Yes, I agree, and then click Create your Twitter application.

  5. Click the Permissions tab. The default permission is Read only. This is sufficient for this tutorial.

  6. Click the API Keys tab.

  7. Click Create my access token.

  8. Click Test OAuth in the upper right corner of the page.

  9. Write down API key, API secret, Access token, and Access token secret. You will need the values later in the tutorial.

    hdi.hbase.twitter.sentiment.twitter.app

Create a simple Twitter streaming service

Create a console application to get Tweets, calculate Tweet sentiment score and send the processed Tweet words to HBase.

To create the Visual Studio solution:

  1. Open Visual Studio.
  2. From the File menu, point to New, and then click Project.
  3. Type or select the following values:

    • Templates: Visual C#
    • Template: Console Application
    • Name: TweetSentimentStreaming
    • Location: C:\Tutorials
    • Solution name: TweetSentimentStreaming
  4. Click OK to continue.

To install Nuget packages and add SDK references:

  1. From the Tools menu, click Nuget Package Manager, and then click Package Manager Console. The console panel will open at the bottom of the page.
  2. Use the following commands to install the Tweetinvi package, which is used to access the Twitter API, and the Protobuf-net package, which is used to serialize and deserialize objects.

    Install-Package TweetinviAPI Install-Package protobuf-net 
  3. From Solution Explorer, right-click References, and then click Add Reference.

  4. In the left pane, expand Assemblies, and then click Framework.

  5. In the right pane, select the checkbox in front of System.Configuration, and then click OK.

To define the Tweeter streaming service class:

  1. From Solution explorer, right-click TweetSentimentStreaming, point to Add, and then click Class.
  2. In Name, type HBaseWriter, and then click Add.
  3. In HBaseWriter.cs, add the following using statements on the top of the file:

    using System.IO; using System.Threading; using System.Globalization; using Microsoft.HBase.Client; using Tweetinvi.Core.Interfaces; using org.apache.hadoop.hbase.rest.protobuf.generated;
  4. Inside HbaseWriter.cs, add a new class call DictionaryItem:

    public class DictionaryItem { public string Type { get; set; } public int Length { get; set; } public string Word { get; set; } public string Pos { get; set; } public string Stemmed { get; set; } public string Polarity { get; set; } }

    This class structure is used to parse the sentiment dictionary file. The data is used to calculate sentiment score for each Tweet.

  5. Inside the HBaseWriter class, define the following constants and variables:

    // HDinsight HBase cluster and HBase table information
    const string CLUSTERNAME = "https://<HBaseClusterName>.azurehdinsight.net/"; const string HADOOPUSERNAME = "<HadoopUserName>"; //the default name is "admin" const string HADOOPUSERPASSWORD = "<HaddopUserPassword>"; const string HBASETABLENAME = "tweets_by_words"; // Sentiment dictionary file and the punctuation characters const string DICTIONARYFILENAME = @"..\..\data\dictionary\dictionary.tsv"; private static char[] _punctuationChars = new[] { ' ', '!', '\"', '#', '$', '%', '&', '\'', '(', ')', '*', '+', ',', '-', '.', '/', //ascii 23--47 ':', ';', '<', '=', '>', '?', '@', '[', ']', '^', '_', '`', '{', '|', '}', '~' }; //ascii 58--64 + misc. // For writting to HBase HBaseClient client; // a sentiment dictionary for estimate sentiment. It is loaded from a physical file. Dictionary<string, DictionaryItem> dictionary; // use multithread write Thread writerThread; Queue<ITweet> queue = new Queue<ITweet>(); bool threadRunning = true;
  6. Set the constant values, including <HBaseClusterName>, <HadoopUserName>, and <HaddopUserPassword>. If you want to change the HBase table name, you must change the table name in the Web application accordingly.

    You will download and move the dictionary.tsv file to a specific folder later in the tutorial.

  7. Define the following functions inside the HBaseWriter class:

    // This function connects to HBase, loads the sentiment dictionary, and starts the thread for writting.
    public HBaseWriter() { ClusterCredentials credentials = new ClusterCredentials(new Uri(CLUSTERNAME), HADOOPUSERNAME, HADOOPUSERPASSWORD); client = new HBaseClient(credentials); // create the HBase table if it doesn't exist if (!client.ListTables().name.Contains(HBASETABLENAME)) { TableSchema tableSchema = new TableSchema(); tableSchema.name = HBASETABLENAME; tableSchema.columns.Add(new ColumnSchema { name = "d" }); client.CreateTable(tableSchema); Console.WriteLine("Table \"{0}\" is created.", HBASETABLENAME); } // Load sentiment dictionary from a file LoadDictionary(); // Start a thread for writting to HBase writerThread = new Thread(new ThreadStart(WriterThreadFunction)); writerThread.Start(); } ~HBaseWriter() { threadRunning = false; } // Enqueue the Tweets received public void WriteTweet(ITweet tweet) { lock (queue) { queue.Enqueue(tweet); } } // Load sentiment dictionary from a file private void LoadDictionary() { List<string> lines = File.ReadAllLines(DICTIONARYFILENAME).ToList(); var items = lines.Select(line => { var fields = line.Split('\t'); var pos = 0; return new DictionaryItem { Type = fields[pos++], Length = Convert.ToInt32(fields[pos++]), Word = fields[pos++], Pos = fields[pos++], Stemmed = fields[pos++], Polarity = fields[pos++] }; }); dictionary = new Dictionary<string, DictionaryItem>(); foreach (var item in items) { if (!dictionary.Keys.Contains(item.Word)) { dictionary.Add(item.Word, item); } } } // Calculate sentiment score private int CalcSentimentScore(string[] words) { Int32 total = 0; foreach (string word in words) { if (dictionary.Keys.Contains(word)) { switch (dictionary[word].Polarity) { case "negative": total -= 1; break; case "positive": total += 1; break; } } } if (total > 0) { return 1; } else if (total < 0) { return -1; } else { return 0; } } // Popular a CellSet object to be written into HBase private void CreateTweetByWordsCells(CellSet set, ITweet tweet) { // Split the Tweet into words string[] words = tweet.Text.ToLower().Split(_punctuationChars); // Calculate sentiment score base on the words int sentimentScore = CalcSentimentScore(words); var word_pairs = words.Take(words.Length - 1) .Select((word, idx) => string.Format("{0} {1}", word, words[idx + 1])); var all_words = words.Concat(word_pairs).ToList(); // For each word in the Tweet add a row to the HBase table foreach (string word in all_words) { string time_index = (ulong.MaxValue - (ulong)tweet.CreatedAt.ToBinary()).ToString().PadLeft(20) + tweet.IdStr; string key = word + "_" + time_index; // Create a row var row = new CellSet.Row { key = Encoding.UTF8.GetBytes(key) }; // Add columns to the row, including Tweet identifier, language, coordinator(if available), and sentiment var value = new Cell { column = Encoding.UTF8.GetBytes("d:id_str"), data = Encoding.UTF8.GetBytes(tweet.IdStr) }; row.values.Add(value); value = new Cell { column = Encoding.UTF8.GetBytes("d:lang"), data = Encoding.UTF8.GetBytes(tweet.Language.ToString()) }; row.values.Add(value); if (tweet.Coordinates != null) { var str = tweet.Coordinates.Longitude.ToString() + "," + tweet.Coordinates.Latitude.ToString(); value = new Cell { column = Encoding.UTF8.GetBytes("d:coor"), data = Encoding.UTF8.GetBytes(str) }; row.values.Add(value); } value = new Cell { column = Encoding.UTF8.GetBytes("d:sentiment"), data = Encoding.UTF8.GetBytes(sentimentScore.ToString()) }; row.values.Add(value); set.rows.Add(row); } } // Write a Tweet (CellSet) to HBase public void WriterThreadFunction() { try { while (threadRunning) { if (queue.Count > 0) { CellSet set = new CellSet(); lock (queue) { do { ITweet tweet = queue.Dequeue(); CreateTweetByWordsCells(set, tweet); } while (queue.Count > 0); } // Write the Tweet by words cell set to the HBase table client.StoreCells(HBASETABLENAME, set); Console.WriteLine("\tRows written: {0}", set.rows.Count); } Thread.Sleep(100); } } catch (Exception ex) { Console.WriteLine("Exception: " + ex.Message); } }

    The code provides the following functionality:

    • Connect to Hbase [ HBaseWriter() ]: Use the HBase SDK to create a ClusterCredentials object with the cluster URL and the Hadoop user credential, and then create a HBaseClient object using the ClusterCredentials object.
    • Create HBase table [ HBaseWriter() ]: The method call is HBaseClient.CreateTable().
    • Write to HBase table [ WriterThreadFunction() ]: The method call is HBaseClient.StoreCells().

To complete the Program.cs:

  1. From Solution Explorer, double-click Program.cs to open it.
  2. At the beginning of the file, add the following using statements:

    using System.Configuration; using System.Diagnostics; using Tweetinvi;
  3. Inside the Program class, define the following constants:

    const string TWITTERAPPACCESSTOKEN = "<TwitterApplicationAccessToken"; const string TWITTERAPPACCESSTOKENSECRET = "TwitterApplicationAccessTokenSecret"; const string TWITTERAPPAPIKEY = "TwitterApplicationAPIKey"; const string TWITTERAPPAPISECRET = "TwitterApplicationAPISecret";
  4. Set the constant values to match your Twitter application values.

  5. Modify the Main() function, so it looks like:

    static void Main(string[] args) { TwitterCredentials.SetCredentials(TWITTERAPPACCESSTOKEN, TWITTERAPPACCESSTOKENSECRET, TWITTERAPPAPIKEY, TWITTERAPPAPISECRET); Stream_FilteredStreamExample(); }
  6. Add the following function to the class:

    private static void Stream_FilteredStreamExample() { for (; ; ) { try { HBaseWriter hbase = new HBaseWriter(); var stream = Stream.CreateFilteredStream(); stream.AddLocation(Geo.GenerateLocation(-180, -90, 180, 90)); var tweetCount = 0; var timer = Stopwatch.StartNew(); stream.MatchingTweetReceived += (sender, args) => { tweetCount++; var tweet = args.Tweet; // Write Tweets to HBase hbase.WriteTweet(tweet); if (timer.ElapsedMilliseconds > 1000) { if (tweet.Coordinates != null) { Console.ForegroundColor = ConsoleColor.Green; Console.WriteLine("\n{0}: {1} {2}", tweet.Id, tweet.Language.ToString(), tweet.Text); Console.ForegroundColor = ConsoleColor.White; Console.WriteLine("\tLocation: {0}, {1}", tweet.Coordinates.Longitude, tweet.Coordinates.Latitude); } timer.Restart(); Console.WriteLine("\tTweets/sec: {0}", tweetCount); tweetCount = 0; } }; stream.StartStreamMatchingAllConditions(); } catch (Exception ex) { Console.WriteLine("Exception: {0}", ex.Message); } } }

To download the sentiment dictionary file:

  1. Browse to https://github.com/maxluk/tweet-sentiment.
  2. Click Download ZIP.
  3. Extract the file locally.
  4. Copy the file from ../tweet-sentiment/SimpleStreamingService/data/dictionary/dictionary.tsv.
  5. Paste the file to your solution under TweetSentimentStreaming/TweetSentimentStreaming/data/dictionary/dictionary.tsv.

To run the streaming service:

  1. From Visual Studio, press F5. The following is the console application screenshot:

    hdinsight.hbase.twitter.sentiment.streaming.service

  2. Keep the streaming console application running while you developing the Web application, So you have more data to use.

Create an Azure Website to visualize Twitter sentiment

In this section, you will create a ASP.NET MVC Web application to read the real-time sentiment data from HBase and plot the data on Bing maps.

To create a ASP.NET MVC Web application:

  1. Open Visual Studio.
  2. Click File, click New, and then click Project.
  3. Type or enter the following:

    • Template category: Visual C#/Web
    • Template: ASP.NET Web Application
    • Name: TweetSentimentWeb
    • Location: C:\Tutorials
  4. Click OK.

  5. In Select a template, click MVC.

  6. In Windows Azure, click Manage Subscriptions.

  7. From Manage Windows Azure Subscriptions, click Sign in.

  8. Enter your Azure credential. Your Azure subscription information will be shown on the Accounts tab.

  9. Click Close to close the Manage Windows Azure Subscriptions window.

  10. From New ASP.NET Project - TweetSentimentWeb, Click OK.

  11. From Configure Windows Azure Site Settings, select the Region that is closer to you. You don't need to specify a database server.

  12. Click OK.

To install Nuget packages:

  1. From the Tools menu, click Nuget Package Manager, and then click Package Manager Console. The console panel is opened at the bottom of the page.
  2. Use the following command to install the Protobuf-net package, which is used to serialize and deserialize objects.

    Install-Package protobuf-net 

To add HBaseReader class:

  1. From Solution Explorer, expand TweetSentiment.
  2. Right-click Models, click Add, and then click Class.
  3. In Name, enter HBaseReader.cs, and then click Add.
  4. Replace the code with the following:

    using System;
    using System.Collections.Generic; using System.Linq; using System.Web; using System.Configuration; using System.Threading.Tasks; using System.Text; using Microsoft.HBase.Client; using org.apache.hadoop.hbase.rest.protobuf.generated; namespace TweetSentimentWeb.Models { public class HBaseReader { // For reading Tweet sentiment data from HDInsight HBase HBaseClient client; // HDinsight HBase cluster and HBase table information const string CLUSTERNAME = "<HBaseClusterName>"; const string HADOOPUSERNAME = "<HBaseClusterHadoopUserName>" const string HADOOPUSERPASSWORD = "<HBaseCluserUserPassword>"; const string HBASETABLENAME = "tweets_by_words"; // The constructor public HBaseReader() { ClusterCredentials creds = new ClusterCredentials( new Uri(CLUSTERNAME), HADOOPUSERNAME, HADOOPUSERPASSWORD); client = new HBaseClient(creds); } // Query Tweets sentiment data from the HBase table asynchronously  public async Task<IEnumerable<Tweet>> QueryTweetsByKeywordAsync(string keyword) { List<Tweet> list = new List<Tweet>(); // Demonstrate Filtering the data from the past 6 hours the row key string timeIndex = (ulong.MaxValue - (ulong)DateTime.UtcNow.Subtract(new TimeSpan(6, 0, 0)).ToBinary()).ToString().PadLeft(20); string startRow = keyword + "_" + timeIndex; string endRow = keyword + "|"; Scanner scanSettings = new Scanner { batch = 100000, startRow = Encoding.UTF8.GetBytes(startRow), endRow = Encoding.UTF8.GetBytes(endRow) }; // Make async scan call ScannerInformation scannerInfo = await client.CreateScannerAsync(HBASETABLENAME, scanSettings); CellSet next; while ((next = await client.ScannerGetNextAsync(scannerInfo)) != null) { foreach (CellSet.Row row in next.rows) { // find the cell with string pattern "d:coor" var coordinates = row.values.Find(c => Encoding.UTF8.GetString(c.column) == "d:coor"); if (coordinates != null) { string[] lonlat = Encoding.UTF8.GetString(coordinates.data).Split(','); var sentimentField = row.values.Find(c => Encoding.UTF8.GetString(c.column) == "d:sentiment"); Int32 sentiment = 0; if (sentimentField != null) { sentiment = Convert.ToInt32(Encoding.UTF8.GetString(sentimentField.data)); } list.Add(new Tweet { Longtitude = Convert.ToDouble(lonlat[0]), Latitude = Convert.ToDouble(lonlat[1]), Sentiment = sentiment }); } if (coordinates != null) { string[] lonlat = Encoding.UTF8.GetString(coordinates.data).Split(','); } } } return list; } } public class Tweet { public string IdStr { get; set; } public string Text { get; set; } public string Lang { get; set; } public double Longtitude { get; set; } public double Latitude { get; set; } public int Sentiment { get; set; } } }
  5. Inside the HBaseReader class, change the constant values:

    • CLUSTERNAME: The HBase cluster name. For example, https://.azurehdinsight.net/.
    • HADOOPUSERNAME: The HBase cluster Hadoop user username. The default name is admin.
    • HADOOPUSERPASSWORD: The HBase cluster Hadoop user password.
    • HBASETABLENAME = "tweets_by_words";

    The HBase table name is "tweets_by_words". The values must match the values you sent in the streaming service, so that the Web application reads the data from the same HBase table.

To add TweetsController controller:

  1. From Solution Explorer, expand TweetSentimentWeb.
  2. Right-click Controllers, click Add, and then click Controller.
  3. Click Web API 2 Controller - Empty, and then click Add.
  4. In Controller name, type TweetsController, and then click Add.
  5. From Solution Explorer, double-click TweetsController.cs to open the file.
  6. Modify the file, so it looks like the following::

    using System;
    using System.Collections.Generic; using System.Linq; using System.Net; using System.Net.Http; using System.Web.Http; using System.Threading.Tasks; using TweetSentimentWeb.Models; namespace TweetSentimentWeb.Controllers { public class TweetsController : ApiController { HBaseReader hbase = new HBaseReader(); public async Task<IEnumerable<Tweet>> GetTweetsByQuery(string query) { return await hbase.QueryTweetsByKeywordAsync(query); } } }

To add heatmap.js

  1. From Solution Explorer, expand TweetSentimentWeb.
  2. Right-click Scripts, click Add, click JavaScript File.
  3. In Item name, enter heatmap.js.
  4. Copy and paste the following code into the file. The code was written by Alastair Aitchison. For more information, seehttp://alastaira.wordpress.com/2011/04/15/bing-maps-ajax-v7-heatmap-library/.

    /*******************************************************************************
    * Author: Alastair Aitchison
    * Website: http://alastaira.wordpress.com
    * Date: 15th April 2011
    * 
    * Description: 
    * This JavaScript file provides an algorithm that can be used to add a heatmap
    * overlay on a Bing Maps v7 control. The intensity and temperature palette
    * of the heatmap are designed to be easily customisable.
    *
    * Requirements:
    * The heatmap layer itself is created dynamically on the client-side using
    * the HTML5 <canvas> element, and therefore requires a browser that supports
    * this element. It has been tested on IE9, Firefox 3.6/4 and 
    * Chrome 10 browsers. If you can confirm whether it works on other browsers or
    * not, I'd love to hear from you!
    
    * Usage:
    * The HeatMapLayer constructor requires:
    * - A reference to a map object
    * - An array or Microsoft.Maps.Location items
    * - Optional parameters to customise the appearance of the layer
    *  (Radius,, Unit, Intensity, and ColourGradient), and a callback function
    *
    */
    
    var HeatMapLayer = function (map, locations, options) { /* Private Properties */ var _map = map, _canvas, _temperaturemap, _locations = [], _viewchangestarthandler, _viewchangeendhandler; // Set default options var _options = { // Opacity at the centre of each heat point intensity: 0.5, // Affected radius of each heat point radius: 1000, // Whether the radius is an absolute pixel value or meters unit: 'meters', // Colour temperature gradient of the map colourgradient: { "0.00": 'rgba(255,0,255,20)', // Magenta "0.25": 'rgba(0,0,255,40)', // Blue "0.50": 'rgba(0,255,0,80)', // Green "0.75": 'rgba(255,255,0,120)', // Yellow "1.00": 'rgba(255,0,0,150)' // Red }, // Callback function to be fired after heatmap layer has been redrawn  callback: null }; /* Private Methods */ function _init() { var _mapDiv = _map.getRootElement(); if (_mapDiv.childNodes.length >= 3 && _mapDiv.childNodes[2].childNodes.length >= 2) { // Create the canvas element _canvas = document.createElement('canvas'); _canvas.style.position = 'relative'; var container = document.createElement('div'); container.style.position = 'absolute'; container.style.left = '0px'; container.style.top = '0px'; container.appendChild(_canvas); _mapDiv.childNodes[2].childNodes[1].appendChild(container); // Override defaults with any options passed in the constructor _setOptions(options); // Load array of location data _setPoints(locations); // Create a colour gradient from the suppied colourstops _temperaturemap = _createColourGradient(_options.colourgradient); // Wire up the event handler to redraw heatmap canvas _viewchangestarthandler = Microsoft.Maps.Events.addHandler(_map, 'viewchangestart', _clearHeatMap); _viewchangeendhandler = Microsoft.Maps.Events.addHandler(_map, 'viewchangeend', _createHeatMap); _createHeatMap(); delete _init; } else { setTimeout(_init, 100); } } // Resets the heat map function _clearHeatMap() { var ctx = _canvas.getContext("2d"); ctx.clearRect(0, 0, _canvas.width, _canvas.height); } // Creates a colour gradient from supplied colour stops on initialisation function _createColourGradient(colourstops) { var ctx = document.createElement('canvas').getContext('2d'); var grd = ctx.createLinearGradient(0, 0, 256, 0); for (var c in colourstops) { grd.addColorStop(c, colourstops[c]); } ctx.fillStyle = grd; ctx.fillRect(0, 0, 256, 1); return ctx.getImageData(0, 0, 256, 1).data; } // Applies a colour gradient to the intensity map function _colouriseHeatMap() { var ctx = _canvas.getContext("2d"); var dat = ctx.getImageData(0, 0, _canvas.width, _canvas.height); var pix = dat.data; // pix is a CanvasPixelArray containing height x width x 4 bytes of data (RGBA) for (var p = 0, len = pix.length; p < len;) { var a = pix[p + 3] * 4; // get the alpha of this pixel if (a != 0) { // If there is any data to plot pix[p] = _temperaturemap[a]; // set the red value of the gradient that corresponds to this alpha pix[p + 1] = _temperaturemap[a + 1]; //set the green value based on alpha pix[p + 2] = _temperaturemap[a + 2]; //set the blue value based on alpha } p += 4; // Move on to the next pixel } ctx.putImageData(dat, 0, 0); } // Sets any options passed in function _setOptions(options) { for (attrname in options) { _options[attrname] = options[attrname]; } } // Sets the heatmap points from an array of Microsoft.Maps.Locations function _setPoints(locations) { _locations = locations; } // Main method to draw the heatmap function _createHeatMap() { // Ensure the canvas matches the current dimensions of the map // This also has the effect of resetting the canvas _canvas.height = _map.getHeight(); _canvas.width = _map.getWidth(); _canvas.style.top = -_canvas.height / 2 + 'px'; _canvas.style.left = -_canvas.width / 2 + 'px'; // Calculate the pixel radius of each heatpoint at the current map zoom if (_options.unit == "pixels") { radiusInPixel = _options.radius; } else { radiusInPixel = _options.radius / _map.getMetersPerPixel(); } var ctx = _canvas.getContext("2d"); // Convert lat/long to pixel location var pixlocs = _map.tryLocationToPixel(_locations, Microsoft.Maps.PixelReference.control); var shadow = 'rgba(0, 0, 0, ' + _options.intensity + ')'; var mapWidth = 256 * Math.pow(2, _map.getZoom()); // Create the Intensity Map by looping through each location for (var i = 0, len = pixlocs.length; i < len; i++) { var x = pixlocs[i].x; var y = pixlocs[i].y; if (x < 0) { x += mapWidth * Math.ceil(Math.abs(x / mapWidth)); } // Create radial gradient centred on this point var grd = ctx.createRadialGradient(x, y, 0, x, y, radiusInPixel); grd.addColorStop(0.0, shadow); grd.addColorStop(1.0, 'transparent'); // Draw the heatpoint onto the canvas ctx.fillStyle = grd; ctx.fillRect(x - radiusInPixel, y - radiusInPixel, 2 * radiusInPixel, 2 * radiusInPixel); } // Apply the specified colour gradient to the intensity map _colouriseHeatMap(); // Call the callback function, if specified if (_options.callback) { _options.callback(); } } /* Public Methods */ this.Show = function () { if (_canvas) { _canvas.style.display = ''; } }; this.Hide = function () { if (_canvas) { _canvas.style.display = 'none'; } }; // Sets options for intensity, radius, colourgradient etc. this.SetOptions = function (options) { _setOptions(options); } // Sets an array of Microsoft.Maps.Locations from which the heatmap is created this.SetPoints = function (locations) { // Reset the existing heatmap layer _clearHeatMap(); // Pass in the new set of locations _setPoints(locations); // Recreate the layer _createHeatMap(); } // Removes the heatmap layer from the DOM this.Remove = function () { _canvas.parentNode.parentNode.removeChild(_canvas.parentNode); if (_viewchangestarthandler) { Microsoft.Maps.Events.removeHandler(_viewchangestarthandler); } if (_viewchangeendhandler) { Microsoft.Maps.Events.removeHandler(_viewchangeendhandler); } _locations = null; _temperaturemap = null; _canvas = null; _options = null; _viewchangestarthandler = null; _viewchangeendhandler = null; } // Call the initialisation routine _init(); }; // Call the Module Loaded method Microsoft.Maps.moduleLoaded('HeatMapModule');

To add tweetStream.js:

  1. From Solution Explorer, expand TweetSentimentWeb.
  2. Right-click Scripts, click Add, click JavaScript File.
  3. In Item name, enter twitterStream.js.
  4. Copy and paste the following code into the file:

    var liveTweetsPos = []; var liveTweets = []; var liveTweetsNeg = []; var map; var heatmap; var heatmapNeg; var heatmapPos; function initialize() { // Initialize the map var options = { credentials: "AvFJTZPZv8l3gF8VC3Y7BPBd0r7LKo8dqKG02EAlqg9WAi0M7la6zSIT-HwkMQbx", center: new Microsoft.Maps.Location(23.0, 8.0), mapTypeId: Microsoft.Maps.MapTypeId.ordnanceSurvey, labelOverlay: Microsoft.Maps.LabelOverlay.hidden, zoom: 2.5 }; var map = new Microsoft.Maps.Map(document.getElementById('map_canvas'), options); // Heatmap options for positive, neutral and negative layers var heatmapOptions = { // Opacity at the centre of each heat point intensity: 0.5, // Affected radius of each heat point radius: 15, // Whether the radius is an absolute pixel value or meters unit: 'pixels' }; var heatmapPosOptions = { // Opacity at the centre of each heat point intensity: 0.5, // Affected radius of each heat point radius: 15, // Whether the radius is an absolute pixel value or meters unit: 'pixels', colourgradient: { 0.0: 'rgba(0, 255, 255, 0)', 0.1: 'rgba(0, 255, 255, 1)', 0.2: 'rgba(0, 255, 191, 1)', 0.3: 'rgba(0, 255, 127, 1)', 0.4: 'rgba(0, 255, 63, 1)', 0.5: 'rgba(0, 127, 0, 1)', 0.7: 'rgba(0, 159, 0, 1)', 0.8: 'rgba(0, 191, 0, 1)', 0.9: 'rgba(0, 223, 0, 1)', 1.0: 'rgba(0, 255, 0, 1)' } }; var heatmapNegOptions = { // Opacity at the centre of each heat point intensity: 0.5, // Affected radius of each heat point radius: 15, // Whether the radius is an absolute pixel value or meters unit: 'pixels', colourgradient: { 0.0: 'rgba(0, 255, 255, 0)', 0.1: 'rgba(0, 255, 255, 1)', 0.2: 'rgba(0, 191, 255, 1)', 0.3: 'rgba(0, 127, 255, 1)', 0.4: 'rgba(0, 63, 255, 1)', 0.5: 'rgba(0, 0, 127, 1)', 0.7: 'rgba(0, 0, 159, 1)', 0.8: 'rgba(0, 0, 191, 1)', 0.9: 'rgba(0, 0, 223, 1)', 1.0: 'rgba(0, 0, 255, 1)' } }; // Register and load the Client Side HeatMap Module Microsoft.Maps.registerModule("HeatMapModule", "scripts/heatmap.js"); Microsoft.Maps.loadModule("HeatMapModule", { callback: function () { // Create heatmap layers for positive, neutral and negative tweets heatmapPos = new HeatMapLayer(map, liveTweetsPos, heatmapPosOptions); heatmap = new HeatMapLayer(map, liveTweets, heatmapOptions); heatmapNeg = new HeatMapLayer(map, liveTweetsNeg, heatmapNegOptions); } }); $("#searchbox").val("xbox"); $("#searchBtn").click(onsearch); $("#positiveBtn").click(onPositiveBtn); $("#negativeBtn").click(onNegativeBtn); $("#neutralBtn").click(onNeutralBtn); $("#neutralBtn").button("toggle"); } function onsearch() { var uri = 'api/tweets?query='; var query = $('#searchbox').val(); $.getJSON(uri + query) .done(function (data) { liveTweetsPos = []; liveTweets = []; liveTweetsNeg = []; // On success, 'data' contains a list of tweets. $.each(data, function (key, item) { addTweet(item); }); if (!$("#neutralBtn").hasClass('active')) { $("#neutralBtn").button("toggle"); } onNeutralBtn(); }) .fail(function (jqXHR, textStatus, err) { $('#statustext').text('Error: ' + err); }); } function addTweet(item) { //Add tweet to the heat map arrays. var tweetLocation = new Microsoft.Maps.Location(item.Latitude, item.Longtitude); if (item.Sentiment > 0) { liveTweetsPos.push(tweetLocation); } else if (item.Sentiment < 0) { liveTweetsNeg.push(tweetLocation); } else { liveTweets.push(tweetLocation); } } function onPositiveBtn() { if ($("#neutralBtn").hasClass('active')) { $("#neutralBtn").button("toggle"); } if ($("#negativeBtn").hasClass('active')) { $("#negativeBtn").button("toggle"); } heatmapPos.SetPoints(liveTweetsPos); heatmapPos.Show(); heatmapNeg.Hide(); heatmap.Hide(); $('#statustext').text('Tweets: ' + liveTweetsPos.length + " " + getPosNegRatio()); } function onNeutralBtn() { if ($("#positiveBtn").hasClass('active')) { $("#positiveBtn").button("toggle"); } if ($("#negativeBtn").hasClass('active')) { $("#negativeBtn").button("toggle"); } heatmap.SetPoints(liveTweets); heatmap.Show(); heatmapNeg.Hide(); heatmapPos.Hide(); $('#statustext').text('Tweets: ' + liveTweets.length + " " + getPosNegRatio()); } function onNegativeBtn() { if ($("#positiveBtn").hasClass('active')) { $("#positiveBtn").button("toggle"); } if ($("#neutralBtn").hasClass('active')) { $("#neutralBtn").button("toggle"); } heatmapNeg.SetPoints(liveTweetsNeg); heatmapNeg.Show(); heatmap.Hide();; heatmapPos.Hide();; $('#statustext').text('Tweets: ' + liveTweetsNeg.length + "\t" + getPosNegRatio()); } function getPosNegRatio() { if (liveTweetsNeg.length == 0) { return ""; } else { var ratio = liveTweetsPos.length / liveTweetsNeg.length; var str = parseFloat(Math.round(ratio * 10) / 10).toFixed(1); return "Positive/Negative Ratio: " + str; } }

To modify the layout.cshtml:

  1. From Solution Explorer, expand TweetSentimentWeb, expand Views, expand Shared, and then double-click _Layout.cshtml.
  2. Replace the content with the following:

    <!DOCTYPE html>
    <html>
    <head> <meta charset="utf-8" /> <meta name="viewport" content="width=device-width, initial-scale=1.0"> <title>@ViewBag.Title</title> @Styles.Render("~/Content/css") @Scripts.Render("~/bundles/modernizr") <!-- Bing Maps --> <script type="text/javascript" src="http://ecn.dev.virtualearth.net/mapcontrol/mapcontrol.ashx?v=7.0&mkt=en-gb"></script> <!-- Spatial Dashboard JavaScript --> <script src="~/Scripts/twitterStream.js" type="text/javascript"></script> </head> <body onload="initialize()"> <div class="navbar navbar-inverse navbar-fixed-top"> <div class="container"> <div class="navbar-header"> <button type="button" class="navbar-toggle" data-toggle="collapse" data-target=".navbar-collapse"> <span class="icon-bar"></span> <span class="icon-bar"></span> <span class="icon-bar"></span> </button> </div> <div class="navbar-collapse collapse"> <div class="row"> <ul class="nav navbar-nav col-lg-5"> <li class="col-lg-12"> <div class="navbar-form"> <input id="searchbox" type="search" class="form-control"> <button type="button" id="searchBtn" class="btn btn-primary">Go</button> </div> </li> </ul> <ul class="nav navbar-nav col-lg-7"> <li> <div class="navbar-form"> <div class="btn-group" data-toggle="buttons-radio"> <button type="button" id="positiveBtn" class="btn btn-primary">Positive</button> <button type="button" id="neutralBtn" class="btn btn-primary">Neutral</button> <button type="button" id="negativeBtn" class="btn btn-primary">Negative</button> </div> </div> </li> <li><span id="statustext" class="navbar-text"></span></li> </ul> </div> </div> </div> </div> <div class="map_container"> @RenderBody() </div> @Scripts.Render("~/bundles/jquery") @Scripts.Render("~/bundles/bootstrap") @RenderSection("scripts", required: false) </body> </html>

To modify the Index.cshtml

  1. From Solution Explorer, expand TweetSentimentWeb, expand Views, expand Home, and then double-click Index.cshtml.
  2. Replace the content with the following:

    @{
        ViewBag.Title = "Tweet Sentiment"; } <div class="map_container"> <div id="map_canvas"/> </div>

To modify the site.css file:

  1. From Solution Explorer, expand TweetSentimentWeb, expand Content, and then double-click Site.css.
  2. Append the following code to the file.

    /* make container, and thus map, 100% width */
    .map_container { width: 100%; height: 100%; } #map_canvas{ height:100%; } #tweets{ position: absolute; top: 60px; left: 75px; z-index:1000; font-size: 30px; }

To modify the global.asax file:

  1. From Solution Explorer, expand TweetSentimentWeb, and then double-click Global.asax.
  2. Add the following using statement:

    using System.Web.Http;
  3. Add the following lines inside the Application_Start() function:

    // Register API routes
    GlobalConfiguration.Configure(WebApiConfig.Register);

    Modify the registration of the API routes to make Web API controller work inside of the MVC application.

To run the Web application:

  1. Verify the streaming service console application is still running. So you can see the real-time changes.
  2. Press F5 to run the web application:

    hdinsight.hbase.twitter.sentiment.bing.map

  3. In the text box, enter a keyword, and then click Go. Depending on the data collected in the HBase table, some keywords might not be found. Try some common keywords, such as "love", "xbox", "playstation" and so on.

  4. Toggle among Positive, Neutral, and Negative to compare sentiment on the subject.

  5. Let the streaming service running for another hour, and then search the same keyword, and compare the results.

Optionally, you can deploy the application to an Azure Web site. For instructions, see Get started with Azure Web Sites and ASP.NET.

Next Steps

In this tutorial we have learned how to get Tweets, analyze the sentiment of Tweets, save the sentiment data to HBase, and present the real-time Twitter sentiment data to Bing maps. To learn more, see:

相关实践学习
lindorm多模间数据无缝流转
展现了Lindorm多模融合能力——用kafka API写入,无缝流转在各引擎内进行数据存储和计算的实验。
云数据库HBase版使用教程
&nbsp; 相关的阿里云产品:云数据库 HBase 版 面向大数据领域的一站式NoSQL服务,100%兼容开源HBase并深度扩展,支持海量数据下的实时存储、高并发吞吐、轻SQL分析、全文检索、时序时空查询等能力,是风控、推荐、广告、物联网、车联网、Feeds流、数据大屏等场景首选数据库,是为淘宝、支付宝、菜鸟等众多阿里核心业务提供关键支撑的数据库。 了解产品详情:&nbsp;https://cn.aliyun.com/product/hbase &nbsp; ------------------------------------------------------------------------- 阿里云数据库体验:数据库上云实战 开发者云会免费提供一台带自建MySQL的源数据库&nbsp;ECS 实例和一台目标数据库&nbsp;RDS实例。跟着指引,您可以一步步实现将ECS自建数据库迁移到目标数据库RDS。 点击下方链接,领取免费ECS&amp;RDS资源,30分钟完成数据库上云实战!https://developer.aliyun.com/adc/scenario/51eefbd1894e42f6bb9acacadd3f9121?spm=a2c6h.13788135.J_3257954370.9.4ba85f24utseFl
相关文章
|
8天前
|
SQL 存储 分布式计算
ODPS技术架构深度剖析与实战指南——从零开始掌握阿里巴巴大数据处理平台的核心要义与应用技巧
【10月更文挑战第9天】ODPS是阿里巴巴推出的大数据处理平台,支持海量数据的存储与计算,适用于数据仓库、数据挖掘等场景。其核心组件涵盖数据存储、计算引擎、任务调度、资源管理和用户界面,确保数据处理的稳定、安全与高效。通过创建项目、上传数据、编写SQL或MapReduce程序,用户可轻松完成复杂的数据处理任务。示例展示了如何使用ODPS SQL查询每个用户的最早登录时间。
29 1
|
13天前
|
SQL 消息中间件 分布式计算
大数据-124 - Flink State 01篇 状态原理和原理剖析:状态类型 执行分析
大数据-124 - Flink State 01篇 状态原理和原理剖析:状态类型 执行分析
50 5
|
5天前
|
机器学习/深度学习 监控 搜索推荐
电商平台如何精准抓住你的心?揭秘大数据背后的神秘推荐系统!
【10月更文挑战第12天】在信息爆炸时代,数据驱动决策成为企业优化决策的关键方法。本文以某大型电商平台的商品推荐系统为例,介绍其通过收集用户行为数据,经过预处理、特征工程、模型选择与训练、评估优化及部署监控等步骤,实现个性化商品推荐,提升用户体验和销售额的过程。
19 1
|
8天前
|
存储 SQL 分布式计算
湖仓一体架构深度解析:构建企业级数据管理与分析的新基石
【10月更文挑战第7天】湖仓一体架构深度解析:构建企业级数据管理与分析的新基石
12 1
|
13天前
|
消息中间件 druid 大数据
大数据-153 Apache Druid 案例 从 Kafka 中加载数据并分析(二)
大数据-153 Apache Druid 案例 从 Kafka 中加载数据并分析(二)
21 2
|
13天前
|
消息中间件 分布式计算 druid
大数据-153 Apache Druid 案例 从 Kafka 中加载数据并分析(一)
大数据-153 Apache Druid 案例 从 Kafka 中加载数据并分析(一)
37 1
|
11天前
|
SQL 分布式计算 大数据
大数据平台的毕业设计01:Hadoop与离线分析
大数据平台的毕业设计01:Hadoop与离线分析
|
13天前
|
分布式计算 Hadoop 数据挖掘
6个常用大数据分析工具集锦
6个常用大数据分析工具集锦
31 0
|
12天前
|
存储 机器学习/深度学习 分布式计算
大数据技术——解锁数据的力量,引领未来趋势
【10月更文挑战第5天】大数据技术——解锁数据的力量,引领未来趋势
|
13天前
|
分布式计算 关系型数据库 MySQL
大数据-88 Spark 集群 案例学习 Spark Scala 案例 SuperWordCount 计算结果数据写入MySQL
大数据-88 Spark 集群 案例学习 Spark Scala 案例 SuperWordCount 计算结果数据写入MySQL
38 3