Download the InfoSphere® Big Match for
Hadoop
installation artifacts and run the installation setup for the node on a machine that has internet
access.
Before you begin
Download the
Big Match
installation artifacts from IBM®
Passport Advantage®.
- In a web browser, navigate to Passport Advantage
Online and sign in with an IBM ID that is entitled to
download InfoSphere Big Match for
Hadoop. If you are not familiar with Passport
Advantage, you can watch a video tour on the home page that explains how to use the site.
- Download the part named IBM InfoSphere Big Match for Hadoop Edition V11.5.0.11
Multiplatform Multilingual eAssembly (CJ7LFEN), along with any other parts that you need
for your deployment.
Refer to the Download IBM
InfoSphere Master Data Management support
article to determine the parts that you should download from Passport Advantage.
- Extract the files to a temporary folder on your system.
About this task
To install Big Match
in offline mode, you must download the installation assets on any machine that has
internet access and then copy them to the Hadoop cluster.
Note: The Big Match
online mode installation is no longer supported.
Procedure
- Extract the installation asset that you downloaded from Passport Advantage.
tar -tzvf IBM_INFOSPHERE_BIG_MATCH_FOR_HADO.tar.gz
The asset's contents are as follows:
- bigmatch_binaries/bigmatch-ambari-service-11.5.0.11-ambari-common.rpm
- bigmatch_binaries/bigmatch-ambari-service-11.5.0.11-hdp-3.1.rpm
- bigmatch_binaries/bigmatch-rpm-11.5.0.11.rpm
- bigmatch_binaries/BIGMATCH-GPG-KEY.public
- bigmatch_binaries/bigmatch-native-rpm-11.5.0.11-x86_64.rpm
- bigmatch_binaries/bigmatch-ambari-service-11.5.0.11-hdp-2.6.rpm
- bigmatch_binaries/bigmatch-native-rpm-11.5.0.11-ppc64le.rpm
- Set up a local repository.
A local repository is required. In the following
steps, the host that performs the repository mirror function is called the repository server. If you
do not have an additional Linux® host, you can use one of the
Hadoop management nodes. The repository server must be accessible over the network by the Hadoop
cluster. The repository server requires an HTTP web server.
Complete the following steps to set
up a repository server by using a Linux host with an Apache
HTTP server.
- On the repository server, if the Apache HTTP server is not installed, install
it:
- On the repository server, ensure the createrepo package is
installed.
- Ensure there is network access from all nodes in your cluster to the repository
server. If data nodes are on a private network and the repository server is external to the Hadoop
cluster, you may need to configure
iptables
for IP forwarding and
masquerading.
- On the repository server, create a directory for your Big Match repository, such as
<mirror web server document root>/repos/BIGMATCH. For
example, for Apache httpd, the default is /var/www/html/repos.
mkdir /var/www/html/repos/BIGMATCH
- When you extracted the installation assets, RPM files were placed to a subdirectory
called bigmatch_binaries. Unzip and copy all the RPMs to the mirror web server
location, <your.mirror.web.server.document root>/repos/BIGMATCH
directory.
cp BigMatch/packages/bigmatch_binaries/* /var/www/html/repos/BIGMATCH/
- Start the web server. If you use Apache httpd, start it by using either of the
following commands:
apachectl start
or
service httpd start
Ensure that any firewall settings allow inbound HTTP access from your cluster nodes to the mirror
web server.
- Test your local repository by browsing to the web
directory:
http://<your.mirror.web.server>/repos/BIGMATCHYou
should see all of the files that you copied to the repository server.
- On the repository server, run the createrepo command to initialize
the repository:
createrepo /var/www/html/repos/BIGMATCH
- In the BigMatch/packages directory, find the RPMs to install on
the Ambari Server host of the Hadoop cluster:
bigmatch-ambari-service-<project_version>-<platform>.rpm
For example:
bigmatch-ambari-service-11.5.0.11-ambari-common.rpm
bigmatch-ambari-service-11.5.0.11-hdp-2.6.rpm
bigmatch-ambari-service-11.5.0.11-hdp-3.1.rpm
Then, copy the files to the Ambari Server host and install the RPMs by using the following
commands (replace <product-version> with the Big Match version):
sudo yum localinstall bigmatch-ambari-service-<product-version>-ambari-common.rpm
sudo yum localinstall bigmatch-ambari-service-<product-version>-hdp-2.6.rpm
sudo yum localinstall bigmatch-ambari-service-<product-version>-hdp-3.1.rpm
- On the Ambari Server node, navigate to and edit the repoinfo.xml
file for the Big Match service.
Tip: In Big Match version 11.5.0.11 and above, repoinfo.xml is
located in
/var/lib/ambari-server/resources/stacks/<stack_name>/<version_number>/services/BIGMATCH/repos/.
In version 11.5.0.10 and below, it is located in
/var/lib/ambari-server/resources/stacks/<stack_name>/<version_number>/repos/repoinfo.xml.
Ensure the <baseurl>
elements for the BIGMATCH-some_arch-build_id
<repo>
entry point to your repository server. Remember, there will
be multiple <repo>
sections. Make sure that the URL you
tested in step 5g exactly matches the value indicated in the <baseurl>
element.
For example, the
repoinfo.xml file might look like the following after you
change
http://ibm-open-platform.ibm.com/repos/Bigmatch/ to
http://your.mirror.web.server/repos/BIGMATCH:
<repo>
<baseurl> http://<your.mirror.web.server>/repos/BIGMATCH</baseurl>
<repoid>BIGMATCH-some_arch-build_id</repoid>
<reponame>BIGMATCH-some_arch-build_id</reponame>
</repo>
- When the module is installed, restart the Ambari server.